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PDE5A CRYSTAL STRUCTURE AND USES 

CROSS-REFERENCE TO RELATED PATENT APPLICATIONS 

[0001] This application claims the benefit of Milburn, U.S. Provisional Application 
60/444,734, filed February 3, 2003 and Artis et al., U.S. Provisional Application 
60/485,627, filed July 7, 2003, all of which are incorporated herein by reference in their 
entireties, including drawings. 

BACKGROUND OF THE INVENTION 

[0002] This invention relates to the field of development of ligands for 
phosphodiesterase 5 A (PDE5A) and to the use of crystal structures of PDE5A. The 
information provided is intended solely to assist the understanding of the reader. None of 
the information provided nor references cited is admitted to be prior art to the present 
invention. 

[0003] PDEs were first detected by Sutherland and co-workers (Rail, et al., J. Biol. 

Chem., 232:1065-1076 (1958), Butcher, et al., J. Biol Chem., 237:1244-1250 (1962)). 

The superfamily of PDEs is subdivided into two major classes, class I and class II 

(Charbonneau, H., Cyclic Nucleotide Phosphodiesterases: Structure, Regulation and Drug 

Action, Beavo, J., and Houslay, M.D., eds) 267-296 John Wiley & Sons, Inc., New York 

(1990)), which have no recognizable sequence similarity. Class I includes all known 

mammalian PDEs and is comprised of 1 1 identified families that are products of separate 

genes (Beavo, et al., Mol. Pharmacol,, 46:399-405 (1994); Conti, et al., Endocr. Rev., 

16:370-389 (1995); Degerman, et al., J. Biol Chem., 272:6823-6826 (1997); Houslay, 

M.D.,Adv. Enzyme Regul, 35:303-338 (1995); Bolger, G.B., Cell Signal, 6:851-859 

(1994); Thompson, et al, Adv. Second Messenger Phosphoprotein Res., 25:165-184 

(1992); Underwood, et al., J. Pharmacol. Exp. Ther., 270:250-259 (1994); Michaeli, et al., 

J. Biol. Chem., 268:12925-12932 (1993); Soderling, et al., Proc. Natl Acad Sci. U.S.A., 

95:8991-8996 (1998); Soderling, et al., J. Biol. Chem., 273:15553-15558 (1998); Fisher, et 

al., J. Biol Chem., 273:15559-15564 (1998)). Some PDEs are highly specific for 
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hydrolysis of cAMP (PDE4, PDE7, PDE8), some are highly cGMP-specific (PDE5, 
PDE6, PDE9) 5 and some have mixed specificity (PDE1, PDE2, PDE3, PDE10). 

[0004] All of the characterized mammalian PDEs are dimeric, but the importance of the 
dimeric structure for function in each of the PDEs is unknown. Each PDE has a conserved 
catalytic domain of -270 amino acids with a high degree of conservation (25-30%) of 
amino acid sequence among PDE families, which is located carboxyl-terminal to its 
regulatory domain. Activators of certain PDEs appear to relieve the influence of 
autoinhibitory domains located within the enzyme structures (Sonnenberg, et al., J. Biol. 
Chem., 270:30989-31000 (1995); Jin, et al., J. Biol Chem., 267:18929-18939 (1992)). 

[0005] PDEs cleave the cyclic nucleotide phosphodiester bond between the phosphorus 
and oxygen atoms at the 3 '-position with inversion of configuration at the phosphorus 
atom (Goldberg, et al., J. Biol Chem., 255:10344-10347 (1980); Burgers, et al., J. Biol 
Chem., 254 :9959-9961 (1979)). This apparently results from an in-line nucleophilic attack 
by the OH— of ionized H2O. It has been proposed that metals bound in the conserved 
metal binding motifs within PDEs facilitate the production of the attacking OH— (Francis, 
et al., J. Biol Chem., 269:22477-22480 (1994)). The kinetic properties of catalysis are 
consistent with a random order mechanism with respect to cyclic nucleotide and the 
divalent cations(s) that are required for catalysis (Srivastava, et al., Biochem. J., 308 :653- 
658 (1995)). The catalytic domains of all known mammalian PDEs contain two sequences 
(ELY3 HX n (E/D)) arranged in tandem, each of which resembles the single Zn 2+ -binding site 
of metalloendoproteases such as thermolysin (Francis, et al., J. Biol Chem., 269 :22477- 
22480 (1994)). PDES specifically binds Zn 2+ , and the catalytic activities of PDE4, PDE5, 
and PDE6 are supported by submicromolar concentrations of Zn 2+ (Francis, et al., J. Biol 
Chem., 269:22477-22480 (1994); Percival, et al., Biochem. Biophys. Res. Commun., 
241:175-180 (1997)). Whether each of the Zn 2+ -binding motifs binds Zn 2+ independently 
or whether the two motifs interact to form a novel Zn 2+ -binding site is not known. The 
catalytic mechanism for cleaving phosphodiester bonds of cyclic nucleotides by PDEs 
may be similar to that of certain proteases for cleaving the amide ester of peptides, but the 
presence of two Zn 2+ motifs arranged in tandem in PDEs is unprecedented. 

[0006] The group of Sutherland and Rail (Berthet, et al., J. Biol Chem., 229:351-361 
(1957)), in the late 1950s, was the first to realize that at least part of the mechanism(s) 
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whereby caffeine enhanced the effect of glucagon, a stimulator of adenylyl cyclase, on 
cAMP accumulation and glycogenolysis in liver involved inhibition of cAMP PDE 
activity. Since that time chemists have synthesized thousands of PDE inhibitors, including 
the widely used 3-isobutyl-l-methylxanthine (IB MX). Many of these compounds, as well 
as caffeine, are non-selective and inhibit many of the PDE families. One important 
advance in PDE research has been the discovery/design of family-specific inhibitors such 
as the PDE4 inhibitor, rolipram, and the PDE5 inhibitor, sildenafil. 

[0007] Precise modulation of PDE function in cells is critical for maintaining cyclic 
nucleotide levels within a narrow rate-limiting range of concentrations. Increases in 
cGMP of 2-4-fold above the basal level will usually produce a maximum physiological 
response. There are three general schemes by which PDEs are regulated: (a) regulation by 
substrate availability, such as by stimulation of PDE activity by mass action after elevation 
of cyclic nucleotide levels or by alteration in the rate of hydrolysis of one cyclic nucleotide 
because of competition by another, which can occur with any of the dual specificity PDEs 
(e.g. PDE1, PDE2, PDE3); (b) regulation by extracellular signals that alter intracellular 
signaling (e.g. phosphorylation events, Ca 2+ , phosphatidic acid, inositol phosphates, 
protein-protein interactions, etc.) resulting, for example, in stimulation of PDE3 activity 
by insulin (Degerman, et al., J. Biol. Chem., 272:6823-6826 (1997)), stimulation of PDE6 
activity by photons through the transducin system (Yamazaki, et al., J. Biol. Chem., 
255 :11619-11624 (1980)), which alters PDE6 interaction with this enzyme, or stimulation 
of PDE 1 activity by increased interaction with Ca 2+ /calmodulin; (c) feedback regulation, 
such as by phosphorylation of PDE 1, PDE3, or PDE4 catalyzed by PKA after cAmP 
elevation (Conti, et al., Endocr. Rev., 16:370-389 (1995); Degerman, et al., J. Biol. Chem., 
272:6823-6826 (1997); Gettys, et al., J. Biol. Chem. 262:333-339 (1987); Florio, et al, 
Biochemistry, 33:8948-8954 (1994)), by allosteric cGMP binding to PDE2 to promote 
breakdown of cAMP or cGMP after cGMP elevation, or by modulation of PDE protein 
levels, such as the desensitization that occurs by increased concentrations of PDE3 or 
PDE4 following chronic exposure of cells to cAMP-elevating agents (Conti, et al., 
Endocr. Rev., 16:370-389 (1995), Sheth, et al., Throm. Haemostasis, 77:155-162 (1997)) 
or by developmentally related changes in PDE5 content. Other factors that could 
influence any of the three schemes outlined above are cellular compartmentalization of 
PDEs (Houslay, M.D., Adv. Enzyme Regul, 35:303-338 (1995)) effected by covalent 
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modifications such as prenylation or by specific targeting sequences in the PDE primary 
structure and perhaps translocation of PDEs between compartments within a cell. 

[0008] Within the PDE superfamily, four (PDE2, PDES, PDE6, and PDE10) of the 10 
families contain highly cGMP-specific allosteric (non-catalytic) cGMP -binding sites in 
addition to a catalytic site of varying substrate specificity. Each of the monomers of these 
dimeric cGMP-binding PDEs contains two homologous cGMP-binding sites of ~1 10 
amino acids arranged in tandem and located in the amino-terminal portion of the protein 
(Charbonneau, H., Cyclic Nucleotide Phosphodiesterases: Structure, Regulation and Drug 
Action, Beavo, J., and Houslay, M.D., eds) 267-296 (1990); McAllister-Lucas, et al., J. 
Biol Chem., 270:30671-30679 (1995)). In PDE2, binding of the cGMP to these sites 
stimulates the hydrolysis of cAMP at the catalytic site (Beavo, et al., Mol Pharmacol., 
46:399-405 (1994)). PDE2 hydrolyzed cGMP as well as cAMP, and cGMP hydrolysis is 
stimulated by cGMP binding at the allosteric sites in accordance with positively 
cooperative kinetics (Manganiello, et al., Cyclic Nucleotide Phosphodiesterases: 
Structure, Regulation, and Drug Action, Beavo, J., andHouslay, M.D., eds, 61-85 John 
Wiley & Sons, Inc., New York (1990)). This could represent a negative feedback process 
for regulation of tissue cGMP levels (Manganiello, et al., Cyclic Nucleotide 
Phosphodiesterases: Structure, Regulation, and Drug Action, Beavo, J., and Houslay, 
M.D., eds, 61-85 John Wiley & Sons, Inc., New York (1990)), which occurs in addition to 
the cross-talk between cyclic nucleotide pathways represented by cGMP stimulation of 
cAMP breakdown. Binding of cGMP to the allosteric sites of PDE6 has not been shown 
to affect catalysis, but this binding may modulate the interaction of PDE6 with the 
regulatory protein, transducin, and the inhibitory y subunit of PDE6 (Yamazaki, et al., 
Adv. Cyclic Nucleotide Protein Phosphorylation Res., 16:381-392 (1984)). 

[0009] The first recognized cGMP -binding PDE was discovered as a cGMP-binding 
protein in lung tissue during a search for cyclic nucleotide-binding proteins other than 
cyclic nucleotide-dependent protein kinases (Lincoln, et al., Proc. Natl Acad. Sci U.S.A., 
73:2559-2563 (1976)). By DEAE-cellulose chromatography, this protein appeared as a 
"peak 1" cGMP-binding protein that was separated from a "peak 2" cGMP-binding 
protein, which was shown to be PKG. The peak 1 protein possessed both cGMP -binding 
as well as a distinct cGMP-specific PDE catalytic activity (Francis, et al., J. Biol. Chem., 
255:620-626 (1980)), and it was subsequently named PDE5. Davis and Kuo (Davis, et al., 
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J. Biol Chem., 252:4078-4084 (1977)) also described a cGMP-specific PDE activity in 
lung tissue, and Hamet and Coquil (Hamet, et al., J. Cyclic Nucleotide Res,, 4:281-290 
(1978)) characterized a cGMP-binding, cGMP-specific PDE in platelets. 

[0010] PDE5 has been purified and cloned (Francis, et al, J. Biol. Chem.. 255:620-626 
(1980); Francis, et al., Methods Enzymol, 159:722-729 (1988); Thomas, et al., J. Biol 
Chem., 265:14964-14970 (1990); McAllister-Lucas, et al., J. Biol Chem., 268:22863- 
22873 (1993)). Two alternatively spliced variants of PDE5 have recently been identified 
(Yanaka, et al., Eur. J. Biochem., 255:391-399 (1998); Loughney, et al., Gene (Amst.)., 
216:137-147 (1998)). The tissue distribution of PDE5 (subunit M r -100,000) commonly 
coincides with that of PKG. This is probably not fortuitous because both PDE5 and PKG 
are major intracellular receptors for cGMP, and PKG is an excellent catalyst in vitro for 
phosphorylation of PDE5 (Thomas, et al., J. Biol Chem., 265:14971-14978 (1990)). 

[0011] Evidence regarding the presence of conserved Zn 2+ -binding motifs (HX3 
HX n (E/D)) in PDEs and their involvement in catalysis was first demonstrated using PDE5 
(Francis, et al., J. Biol. Chem., 269:22477-22480 (1994)). Site-directed mutagenesis 
confirms the catalytic importance of each residue of these motifs A and B (Turko, et al., J. 
Biol. Chem., 273 :6460-6466 (1998)). Substitution of either of the invariant aspartic acid 
residues (Asp-714, Asp-754) further downstream in the sequence is also highly 
deleterious, and each of these residues may participate in the catalytic process perhaps as a 
catalytic base or as a coordinating ligand for a required metal. The most dramatic 
increases in K m for cGMP are caused by site-directed mutagenesis of Tyr-602 and Glu- 
775. These two residues could form part of the cGMP -binding pocket of the catalytic site 
of PDE5. Because some mutations affecting & ca t and K m are juxtaposed in the primary 
sequence, the cGMP-binding pocket and catalytic machinery are likely to involve 
overlapping subdomains within the catalytic domain of PDES. All of the components 
required for catalytic activity of PDE5 are contained within a single monomeric catalytic 
domain. (Furchgott & Vanhoutte, FASEB J. 3:2007-2018 (1997).) 

[0012] Occupation of the allosteric cGMP-binding sites of PDES is required for specific 
phosphorylation of Ser-92 by PKG or PKA, and occupation of the binding sites is also 
associated with an increase in the Stokes radius of the enzyme, implying that a 
conformational change occurs (Francis, et al., Methods, 14:81-92 (1998)). A direct effect 
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of cGMP binding to the allosteric sites on cGMP breakdown at the catalytic site has not 
been demonstrated, although the principle of reciprocity (binding of cGMP at the catalytic 
site stimulates binding at the allosteric sites) dictates that there should be an effect (Weber, 
G., Adv. Protein Chem., 29:1-83 (1975); Francis, et al., Cyclic Nucleotide 
Phosphodiesterases: Structure, Regulation and Drug Action, Beavo, J., and Houslay, 
M.D., eds, 1 17-140, John Wiley & Sons, Inc., New York (1990)). The stimulatory effect 
of cGMP analogs specific for the catalytic site on cGMP binding to the allosteric site(s) of 
PDE5 suggests that interaction of cGMP with the catalytic site precedes cGMP binding to 
the allosteric binding site(s) (Francis, et al., J. Biol. Chem., 255 :620-626 (1980); Thomas, 
et al., J. Biol Chem., 265:14971-14978 (1990)). This implies that upon cGMP elevation 
in cells, cGMP breakdown at the catalytic site would increase because of mass action 
(increased substrate availability). This increased cGMP interaction at the catalytic site 
would enhance cGMP binding at the allosteric sites, thus increasing phosphorylation of the 
enzyme to promote further increases in cGMP breakdown. Although experimental results 
are consistent with such a sequence of events, this pathway has not been proven 
unequivocally in broken cell systems. However, rapid phosphorylation of PDE5, which is 
associated with increased PDE activity, occurs in intact tissues in response to stimulation 
by atrial natriuretic factor and may be caused by PKG action (Wyatt, et al., Am. J. 
Physiol, 274 :H448-H455 (1998)). This process could represent negative-feedback 
regulation of cGMP levels in cells. PKA can also phosphorylate PDE5 in vitro, albeit at 
about 10% the rate at which PKG catalyzes this reaction; whether or not this occurs in vivo 
is uncertain because the concomitant elevation of cGMP and cAMP would be required to 
expose Ser-92 and activate PKA, respectively. Burns et al (Burns, et al., Biochem. J., 
283 :487-491 (1992)) have reported that a partially purified PDE5 from guinea pig lung is 
activated when phosphorylated by PKA. PDE5 may also be regulated by other low 
molecular weight factors, and these could alter the effects of phosphorylation (Lochhead, 
et al., J. Biol Chem., 272:18397-18403 (1997)). As is the case for PDE4, PDE5 may also 
be subject to long term regulation through changes in enzyme concentration in some cell 
types (Sanchez, et al, Pediatr. Res., 43:163-168 (1998); Kotera, et al., Eur. J. Biochem., 
249:434-442 (1997); Bakre, et al., FEBS Lett., 408:345-349 (1997)). 

[0013] The K D of PDE5 for binding cGMP in the allosteric sites is -0.2 /jM (Thomas, et 

al., J. Biol Chem., 265:14964-14970 (1990)). The presence of two kinetically distinct 

allosteric cGMP-binding sites in PDE5 was first suggested by the curvilinear pattern of 
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cGMP dissociation from the enzyme. Studies using site-directed mutagenesis confirm the 
presence of two sites and indicate that the binding of cGMP to each allosteric site could 
involve a NK(JQ rt D motif (McAllister-Lucas, et al., J. Biol Chem., 270:30671-30679 
(1995); Turko, et al., J. Biol. Chem., 271:22240-22244 (1996)), which resembles that used 
by G proteins for binding GTP (Pai, et al, Nature, 341:209-214 (1989)). The conserved 
sequence of the allosteric cyclic nucleotide -binding sites in PDE2, PDE5, PDE6, and 
PDE10 is evolutionary distinct from that of the family containing PKG, PKA, and cation 
channels (McAllister-Lucas, et al., J. Biol Chem., 268:22863-22873 (1993)), indicating 
that the allosteric cGMP-binding sites of these PDEs represent a newly recognized class of 
cyclic nucleotide receptors. Another class may be represented by the catalytic sites of 
PDEs, the sequences of which contain a binding pocket for cyclic nucleotides in the 
catalytic domain in order to optimize the catalytic process. In PDES, classical PDE 
inhibitors and selected cyclic nucleotide analogs compete with cGMP at the catalytic site 
but do not interact with the cGMP-binding allosteric sites (Francis, et al., Cyclic 
Nucleotide Phosphodiesterases: Structure, Regulation and Drug Action, Beavo, J., and 
Houslay, M.D., eds, 1 17-140, John Wiley & Sons, Inc., New York (1990)). The order of 
potency of some common PDE inhibitors for PDES is sildenafil > zaprinast > 
dipyridamole > IBMX > cilostamide > theophylline > caffeine > rolipram (Fig. 3) 
(Thomas, et al., J. Biol Chem.. 265:14964-14970 (1990); Ballard, et al., J. Urol, 
159 :2164-2171 (1998)). Many cyclic nucleotide analogs are also inhibitors of PDE5 
(Francis, et al., Cyclic Nucleotide Phosphodiesterases: Structure, Regulation and Drug 
Action, Beavo, J., and Houslay, M.D., eds, 1 17-140, John Wiley & Sons, Inc., New York 
(1990)), which is to be expected based on the structural similarity of these compounds 
with cGMP. Some IBMX analogs modified at the 8-position, such as 8-(2-chlorobenzyl)- 
IBMX, are more potent inhibitors than are any of the cyclic nucleotide analogs (Sekhar, et 
al., Phosphodiesterase Inhibitors, Schudt, C, Dent, G., and Rabe, K.F., eds, 135-146, 
Academic Press, New York (1996)). Even though the IBMX analogs are generally better 
PDE5 inhibitors than are cyclic nucleotide analogs, many of the latter are more potent for 
relaxing intact vascular smooth muscle. 

[0014] Because the PDE inhibitors show competitive kinetics with respect to cGMP in 

the catalytic site of PDES, they would be expected to form molecular contacts like those 

formed by cGMP. However, results of mutagenesis of PDE5 indicate that, although both 

zaprinast, a potent PDES inhibitor, and cGMP appear to make contact with several of the 
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same amino acids in the catalytic domain, some of the residues that are important for 
interaction with zaprinast, e.g. Asp-754 and Gly-780, are not critical for interaction with 
cGMP (Turko, et al., J. Biol Chem., 273:6460-6466 (1998)). As noted above, Asp-754 is 
crucial for efficient catalysis, which is suggestive that inhibition by zaprinast could be due 
in part to interference with an important function of Asp-754. 

[0015] The PDE5 subfamily has only one member: PDE5A (Corbin and Francis, "Cyclic 
GMP Phosphodiesterase-5: Target of Sildenafil," The Journal of Biological Chemistry, 
274(20):13729-13732 (1999)). PDE5 possesses a preference for cGMP over cAMP as a 
substrate. PDE5 is expressed in smooth muscle tissue (Table 1), importantly in the corpus 
cavernosum. This enzyme possesses two GAF domains in the N-terminal regulatory 
region. These GAF domains act in concert to bind cGMP and mediate dimerization and 
activation PDE activity. A recent crystal structure of the PDE2 GAF domain suggests 
possible mechanisms by which the GAF domains bind cGMP and mediate dimerization 
(Martinez et al., Proc Natl Acad Sci USA 99:13260-13265 (2002)). PDE5 has attracted 
considerable attention as a therapeutic target due to the tremendous commercial success of 
Viagra (Pfizer) (Rotella, 2002, Phosphodiesterase 5 inhibitors: Current status and potential 
applications, Nature Reviews 1 :674-682). In addition to Viagra (sildenafil), two other 
drugs are quite far along in the approval process, namely vardenafil (Bayer) and tadalafil 
(Lilly/ICOS). One apparent drawback to these compounds is some cross-reactivity with 
the closely related PDE families PDE6 and PDE1 1 (Gresser and Gleiter, Eur J Med Res 
7:435-446 (2002)). The availability of PDE5 structural information may enable the 
discovery of PDE5 inhibitors with improved selectivity versus PDE6 and PDE1 1. The 
crystal structure of PDE 5 has not been reported in the literature. 



SUMMARY OF THE INVENTION 

[0016] The present invention concerns structural information about PDE5A, crystals of 
PDE5A with and without binding compounds, and the use of the PDE5 A crystals and 
structural information about the PDE5A to develop PDE5A ligands, which can be 
developed from new chemical classes, or can be developed from previously known 
PDE5A ligands. 
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[0017] Thus, in a first aspect, the invention concerns a method for developing ligands 
binding to a PDE5 A, where the method includes identifying as molecular scaffolds one or 
more compounds that bind to a binding site of PDE5A; determining the orientation of at 
least one molecular scaffold in co-crystals with PDE5A; identifying chemical structures of 
one or more of the molecular scaffolds, that, when modified, alter the binding affinity or 
binding specificity or both between the molecular scaffold and the PDE5A; and 
synthesizing a ligand in which one or more of the chemical structures of the molecular 
scaffold is modified to provide a ligand that binds to the PDE5A with altered binding 
affinity or binding specificity or both. 

[0018] The terms "PDE5A phosphodiesterase" and "PDE5A" mean an enzymatically 
active phosphodiesterase that contains a portion with greater than 90% amino acid 
sequence identity to amino acid residues 531-875 of native PDE5A as shown in Table 4, 
for a maximal alignment over an equal length segment; or that contains a portion with 
greater than 90% amino acid sequence identity to at least 200 contiguous amino acids 
from amino acid residues 531-875 of native PDE5A or the amino acid sequence provided 
in Table 2 that retains binding to natural PDE5A ligand cGMP. Preferably the sequence 
identity is at least 95, 97, 98, 99, or even 100%. Preferably the specified level of sequence 
identity is over a sequence at least 300 contiguous amino acid residues. 

[0019] The term "PDE5A phosphodiesterase domain" refers to a reduced length PDE5A 
(i.e., shorter than a full-length PDE5A by at least 100 amino acids that includes the 
phosphodiesterase catalytic region in PDE5 A. Highly preferably for use in this invention, 
the phosphodiesterase domain retains phosphodiesterase activity, preferably at least 50% 
the level of phosphodiesterase activity as compared to the native PDE5A, more preferably 
at least 60, 70, 80, 90, or 100% of the native activity. 

[0020] As used herein, the terms "ligand" and "modulator" are used equivalently to 
refer to a compound that modulates the activity of a target biomolecule, e.g. , an enzyme 
such as a kinase or phosphodiesterase. Generally a ligand or modulator will be a small 
molecule, where "small molecule refers to a compound with a molecular weight of 1500 
daltons or less, or preferably 1000 daltons or less, 800 daltons or less, or 600 daltons or 
less. Thus, an "improved ligand" is one that possesses better pharmacological and/or 
pharmacokinetic properties than a reference compound, where "better" can be defined by a 
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person for a particular biological system or therapeutic use. In terms of the development 
of ligands from scaffolds, a ligand is a derivative of a scaffold. 

[0021] In the context of binding compounds, molecular scaffolds, and ligands, the term 
"derivative" or "derivative compound" refers to a compound having a chemical structure 
that contains a common core chemical structure as a parent or reference compound, but 
differs by having at least one structural difference, e.g., by having one or more substituents 
added and/or removed and/or substituted, and/or by having one or more atoms substituted 
with different atoms. Unless clearly indicated to the contrary, the term "derivative" does 
not mean that the derivative is synthesized using the parent compound as a starting 
material or as an intermediate, although in some cases, the derivative may be synthesized 
from the parent. 

[0022] Thus, the term "parent compound" refers to a reference compound for another 
compound, having structural features continued in the derivative compound. Often but not 
always, a parent compound has a simpler chemical structure than the derivative. 

[0023] By "chemical structure" or "chemical substructure" is meant any definable atom 
or group of atoms that constitute a part of a molecule. Normally, chemical substructures 
of a scaffold or ligand can have a role in binding of the scaffold or ligand to a target 
molecule, or can influence the three-dimensional shape, electrostatic charge, and/or 
conformational properties of the scaffold or ligand. 

[0024] The term "binds" in connection with the interaction between a target and a 
potential binding compound indicates that the potential binding compound associates with 
the target to a statistically significant degree as compared to association with proteins 
generally {i.e., non-specific binding). Thus, the term "binding compound" refers to a 
compound that has a statistically significant association with a target molecule. Preferably 
a binding compound interacts with a specified target with a dissociation constant (k<j) of 1 
mM or less. A binding compound can bind with "low affinity", "very low affinity", 
"extremely low affinity", "moderate affinity", "moderately high affinity", or "high 
affinity" as described herein. 

[0025] In the context of compounds binding to a target, the term "greater affinity" 
indicates that the compound binds more tightly than a reference compound, or than the 
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same compound in a reference condition, i.e., with a lower dissociation constant. In 
particular embodiments, the greater affinity is at least 2, 3, 4, 5, 8, 10, 50, 100, 200, 400, 
500, 1000, or 10,000-fold greater affinity. 

[0026] Also in the context of compounds binding to a biomolecular target, the term 
"greater specificity" indicates that a compound binds to a specified target to a greater 
extent than to another biomolecule or biomolecules that may be present under relevant 
binding conditions, where binding to such other biomolecules produces a different 
biological activity than binding to the specified target. Typically, the specificity is with 
reference to a limited set of other biomolecules, e.g., in the case of PDE5A, other 
phosphodiesterases {e.g., PDE1, PDE6, and/or PDE1 1) or even other type of enzymes. In 
particular embodiments, the greater specificity is at least 2, 3, 4, 5, 8, 10, 50, 100, 200, 
400, 500, or 1000-fold greater specificity. 

[0027] As used in connection with binding of a compound with PDE5A, the term 
"interact" indicates that the distance from a bound compound to a particular amino acid 
residue will be 5.0 angstroms or less. In particular embodiments, the distance from the 
compound to the particular amino acid residue is 4.5 angstroms or less, 4.0 angstroms or 
less, or 3.5 angstroms or less. Such distances can be determined, for example, using co- 
crystallography, or estimated using computer fitting of a compound in a PDE5A active 
site. 

[0028] For reference to particular amino acid residues in PDE5A, polypeptide residue 
number is defined by the numbering provided in Yanaka et al., 1998, Eur. J. Biochem. 
255:391-399. 

[0029] In a related aspect, the invention provides a method for developing ligands 
specific for PDE5A, where the method involves determining whether a derivative of a 
compound that binds to a plurality of phosphodiesterases (e.g., a molecular scaffold) has 
greater specificity for the PDE5A phosphodiesterase than the parent compound with 
respect to other phosphodiesterases. 

[0030] As used herein in connection with binding compounds or ligands, the term 
"specific for PDE5A phosphodiesterase", "specific for PDE5A" and terms of like import 
mean that a particular compound binds to PDE5 A to a statistically greater extent than to 
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other phosphodiesterases that may be present in a particular organism. Also, where 
biological activity other than binding is indicated, the term "specific for PDE5 A" indicates 
that a particular compound has greater biological activity associated with binding PDE5A 
than to other phosphodiesterases. Preferably, the specificity is also with respect to other 
biomolecules (not limited to phosphodiesterases) that may be present from an organism. 

[003 1J In another aspect, the invention provides a method for obtaining improved 
ligands binding to PDE5 A, where the method involves identifying a compound that binds 
to PDE5A, determining whether that compound interacts with one or more conserved 
PDE5 A active site residues, and determining whether a derivative of that compound binds 
to the PDE5 A with greater affinity or greater specificity or both than the parent binding 
compound. Binding with greater affinity or greater specificity or both than the parent 
compound indicates that the derivative is an improved ligand. This process can also be 
carried out in successive rounds of selection and derivatization and/or with multiple parent 
compounds to provide a compound or compounds with improved ligand characteristics. 
Likewise, the derivative compounds can be tested and selected to give high selectivity for 
PDE5 A, or to give cross-reactivity to a particular set of targets, for example to a subset of 
phosphodiesterases that includes PDE5A. In particular embodiments, known PDE5A 
inhibitors can be used, and derivatives with greater affinity and/or greater specificity can 
be developed, preferably using PDE5A structure information; greater specificity for 
PDE5A relative to PDE1, PDE6, and/or PDE1 1 is developed. 

[0032] By "molecular scaffold" or "scaffold" is meant a simple target binding molecule 
to which one or more additional chemical moieties can be covalently attached, modified, 
or eliminated to form a plurality of molecules with common structural elements. The 
moieties can include, but are not limited to, a halogen atom, a hydroxyl group, a methyl 
group, a nitro group, a carboxyl group, or any other type of molecular group including, but 
not limited to, those recited in this application. Molecular scaffolds bind to at least one 
target molecule, preferably to a plurality of molecules in a protein family, and the target 
molecule can preferably be a enzyme, receptor, or other protein. Preferred characteristics 
of a scaffold can include binding at a target molecule binding site such that one or more 
substituents on the scaffold are situated in binding pockets in the target molecule binding 
site; having chemically tractable structures that can be chemically modified, particularly 
by synthetic reactions, so that a combinatorial library can be easily constructed; having 
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chemical positions where moieties can be attached that do not interfere with binding of the 
scaffold to a protein binding site, such that the scaffold or library members can be 
modified to form ligands, to achieve additional desirable characteristics, e.g. , enabling the 
ligand to be actively transported into cells and/or to specific organs, or enabling the ligand 
to be attached to a chromatography column for additional analysis. Thus, a molecular 
scaffold is an identified target binding molecule prior to modification to improve binding 
affinity and/or specificity, or other pharmacalogic properties. 

[0033] The term "scaffold core" refers to the core structure of a molecular scaffold onto 
which various substituents can be attached. Thus, for a number of scaffold molecules of a 
particular chemical class, the scaffold core is common to all the scaffold molecules. In 
many cases, the scaffold core will consist of or include one or more ring structures. 

[0034] By "binding site" is meant an area of a target molecule to which a ligand can 
bind non-covalently. Binding sites embody particular shapes and often contain multiple 
binding pockets present within the binding site. The particular shapes are often conserved 
within a class of molecules, such as a molecular family. Binding sites within a class also 
can contain conserved structures such as, for example, chemical moieties, the presence of 
a binding pocket, and/or an electrostatic charge at the binding site or some portion of the 
binding site, all of which can influence the shape of the binding site. 

[0035] By "binding pocket" is meant a specific volume within a binding site. A binding 
pocket can often be a particular shape, indentation, or cavity in the binding site. Binding 
pockets can contain particular chemical groups or structures that are important in the non- 
covalent binding of another molecule such as, for example, groups that contribute to ionic, 
hydrogen bonding, or van der Waals interactions between the molecules. 

[0036] By "orientation", in reference to a binding compound bound to a target molecule 
is meant the spatial relationship of the binding compound (which can be defined by 
reference to at least some of its constituent atoms) to the binding pocket and/or atoms of 
the target molecule at least partially defining the binding pocket. 

[0037] In the context of target molecules in this invention, the term "crystal" refers to a 
regular assemblage of a target molecule of a type suitable for X-ray crystallography. That 
is, the assemblage produces an X-ray diffraction pattern when illuminated with a beam of 
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X-rays. Thus, a crystal is distinguished from an aggolmeration or other complex of target 
molecule that does not give a diffraction pattern. 

[0038] By "co-crystal" is meant a complex of the compound, molecular scaffold, or 
ligand bound non-covalently to the target molecule and present in a crystal form 
appropriate for analysis by X-ray or protein crystallography. In preferred embodiments 
the target molecule-ligand complex can be a protein-ligand complex. 

[0039] The phrase "alter the binding affinity or binding specificity" refers to changing 
the binding constant of a first compound for another, or changing the level of binding of a 
first compound for a second compound as compared to the level of binding of the first 
compound for third compounds, respectively. For example, the binding specificity of a 
compound for a particular protein is increased if the relative level of binding to that 
particular protein is increased as compared to binding of the compound to unrelated 
proteins. 

[0040] As used herein in connection with test compounds, binding compounds, and 
modulators (ligands), the term "synthesizing" and like terms means chemical synthesis 
from one or more precursor materials. 

[0041] The phrase "chemical structure of the molecular scaffold is modified" means that 
a derivative molecule has a chemical structure that differs from that of the molecular 
scaffold but still contains common core chemical structural features. The phrase does not 
necessarily mean that the molecular scaffold is used as a precursor in the synthesis of the 
derivative. 

[0042] By "assaying" is meant the creation of experimental conditions and the gathering 
of data regarding a particular result of the experimental conditions. For example, enzymes 
can be assayed based on their ability to act upon a detectable substrate. A compound or 
ligand can be assayed based on its ability to bind to a particular target molecule or 
molecules. 

[0043] By a "set" of compounds is meant a collection of compounds. The compounds 
may or may not be structurally related. 



1-6 



Atty. Dkt. No.: 039363-1106 



[0044] In another aspect, structural information about PDE5A can also be used to assist 
in determining a struture for another phosphodiesterase, e.g., a PDE2, by creating a 
homology model from an electronic representation of a PDE5A structure. 

[0045] Typically creating such a homology model involves identifying conserved amino 
acid residues between PDE5 A and the other phosphodiesterase of interest; transferring the 
atomic coordinates of a plurality of conserved amino acids in the PDE5A structure to the 
corresponding amino acids of the other phosphodiesterase to provide a rough structure of 
that phosphodiesterase; and constructing structures representing the remainder of the other 
phosphodiesterase using electronic representations of the structures of the remaining 
amino acid residues in the other phosphodiesterase. In particular, coordinates from Table 
1 for conserved residues can be used. Conserved residues in a binding site can be used. 

[0046] To assist in developing other portions of the phosphodiesterase structure, the 
homology model can also utilize, or be fitted with, low resolution x-ray diffraction data 
from one or more crystals of the phosphodiesterase, e.g., to assist in linking conserved 
residues and/or to better specify coordinates for terminal portions of a polypeptide. 

[0047] The PDE5 A structural information used can be for a variety of different PDE5A 
variants, including full-length wild type, naturally-occurring variants (e.g., allelic variants 
and splice variants), truncated variants of wild type or naturally-occuring variants, and 
mutants of full-length or truncated wild-type or naturally-occurring variants (that can be 
mutated at one or more sites). For example, in order to provide a PDE5A structure closer 
to a variety of other phosphodiesterase structures, a mutated PDE5 A that includes a 
mutation to a conserved residue in a binding site can be used. 

[0048] In another aspect, the invention provides a crystalline form of PDE5A, which 
may be a reduced length PDE5A such as a PDE5A phosphodiesterase domain, e.g., having 
atomic coordinates as described in Table 1. The crystalline form can contain one or more 
heavy metal atoms, for example, atoms useful for X-ray crystallography. The crystalline 
form can also include a binding compound in a co-crystal, e.g., a binding compound that 
interacts with one more more conserved PDE5A active site residues, or any two, any three, 
any four, any five, any six of those residues, and can, for example, be a known PDE5A 
inhibitor. PDE5A crystals can be in various environments, e.g., in a crystallography plate, 
mounted for X-ray crystallography, and/or in an X-ray beam. The PDE5 A may be of 
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various forms, e.g., a wild-type, variant, truncated, and/or mutated form as described 
herein. 

[0049] The invention further concerns co-crystals of PDE5A, which may be a reduced 
length PDE5A, e.g., a PDE5A phosphodiesterase domain, and a PDE5A binding 
compound. Advantageously, such co-crystals are of sufficient size and quality to allow 
structural determination of PDE5A to at least 3 Angstroms, 2.5 Angstroms, 2.0 
Angstroms, or 1.8 Angstroms. The co-crystals can, for example, be in a crystallography 
plate, be mounted for X-ray crystallography and/or in an X-ray beam. Such co-crystals 
are beneficial, for example, for obtaining structural information concerning interaction 
between PDE5A and binding compounds. 

[0050] PDE5A binding compounds can include compounds that interact with at least one 
of conserved PDE5A active site residues, or any 2, 3, 4, 5, or 6 of those residues. 
Exemplary compounds that bind to PDE5A include compounds described in references 
cited herein. 

[0051] Likewise, in additional aspects, methods for obtaining PDE5 A crystals and co- 
crystals are provided. In one aspect is provided a method for obtaining a crystal of 
PDE5A phosphodiesterase domain, by subjecting PDE5A phosphodiesterase domain 
protein at 5-20 mg/ml, preferably 8-12 mg/ml, to crystallization condition substantially 
equivalent to: 10% (w/v) PEG3000, 100 mM phosphate-citrate (pH 4.3), 200 mM NaCl, 
ImM DTT, ImM Sp-cAMP. In general, the PDE5A will be in a solution containing the 
protein and suitable buffer. 

[0052] Crystallization conditions can be initially identified using a screening kit, such as 
a Hampton Research (Riverside, C A) screening kit 1 . Conditions resulting in crystals can 
be selected and crystallization conditions optimized based on the demonstrated 
crystallization conditions. To assist in subsequent crystallography, the PDE5A can be 
seleno-methionine labeled. Also, as indicated above, the PDE5A may be any of various 
forms, e.g., truncated to provide a PDE5A phosphodiesterase domain, which can be 
selected to be of various lengths. 

[0053] A related aspect provides a method for obtaining co-crystals of PDE5A, which 
can be a reduced length PDE5A, with a binding compound, by subjecting PDE5A protein 
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at 5-20 mg/ml to crystallization conditions substantially equivalent to 10% (w/v) PEG3000, 
100 mM phosphate-citrate (pH 4.3), 200 mM NaCl, ImM DTT, ImM Sp-cAMP, in the 
presence of binding compound, for a time sufficient for cystal development. The binding 
compound may be added at various concentrations depending on the nature of the 
comound, e.g., final concentration of 0.5 to 1.0 mM. In many cases, the binding 
compound will be in an organic solvent such as demethyl sulfoxide solution (DMSO). 
While not preferred, binding compound can also be soaked into a PDE5A crystal, e.g., 
using conventional techniques. 

[0054] In another aspect, provision of compounds active on PDE5A also provides a 
method for modulating PDE5A activity by contacting PDE5A with a compound that binds 
to PDE5A and interacts with one more conserved PDE5A active site residues, where the 
compound has been identified using a PDE5 A crystal structure. The compound is 
preferably provided at a level sufficient to modulate the activity of PDE5A by at least 
10%, more preferably at least 20%, 30%, 40%, or 50%. In many embodiments, the 
compound will be at a concentration of about 1 |iM, 100 jaM, or 1 mM, or in a range of 1- 
100 nM, 100-500 nM, 500-1000 nM, 1-100 jaM, 100-500 jaM, or 500-1000 ^M. 

[0055] As used herein, the term "modulating" or "modulate" refers to an effect of 
altering a biological activity, especially a biological activity associated with a particular 
biomolecule such as PDE5 A. For example, an agonist or antagonist of a particular 
biomolecule modulates the activity of that biomolecule, e.g., an enzyme. 

[0056] The term "PDE5A activity" refers to a biological activity of PDE5A, particularly 
including phosphodiesterase activity. 

[0057] In the context of the use, testing, or screening of compounds that are or may be 
modulators, the term "contacting" means that the compound(s) are caused to be in 
sufficient proximity to a particular molecule, complex, cell, tissue, organism, or other 
specified material that potential binding interactions and/or chemical reaction between the 
compound and other specified material can occur. 

[0058] In a related aspect, the invention provides a method for treating a patient 
suffering from a disease or condition characterized by abnormal PDE5A 
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phosphodiesterase activity, where the method involves administering to the patient a 
compound identified by fitting to a PDE5A crystal structure. 

[0059] Specific diseases or disorders which might be treated or prevented include those 
described in the Detailed Description herein, and in the references cited therein. 

[0060] As crystals of PDE5A have been developed and analyzed, another aspect 
concerns an electronic representation of PDE5A (which may be a reduced length PDE5A), 
for example, an electronic representation containing atomic coordinate representations 
corresponding to the coordinates listed for PDE5A in Table 1, or a schematic 
representation such as one showing secondary structure and/or chain folding, and may also 
show conserved active site residues. The PDE5A may be wild type, an allelic variant, a 
mutant form, or a modifed form, e.g., as described herein. 

[0061] The electronic representation can also be modified by replacing electronic 
representations of particular residues with electronic representations of other residues. 
Thus, for example, an electronic representation containing atomic coordinate 
representations corresponding to the coordinates for PDE5 A listed in Table 1 can be 
modified by the replacement of coordinates for a particular conserved residue in a binding 
site by a different amino acid. Likewise, a PDE5 A representation can be modified by the 
respective substitutions, insertions, and/or deletions of amino acid residues to provide a 
representation of a structure for PDE6 or PDE1 1. Following a modification or 
modifications, the representation of the overall structure can be adjusted to allow for the 
known interactions that would be affected by the modification or modifications. In most 
cases, a modification involving more than one residue will be performed in an iterative 
manner. 

[0062] In addition, an electronic representation of a PDE5 A binding compound or a test 
compound in the binding site can be included, e.g., a non-hydrolyzable cGMP analog. 

[0063] Likewise, in a related aspect, the invention concerns an electronic representation 
of a portion of PDE5A, a binding site (which can be an active site) or phosphodiesterase 
domain, for example, residues 531-875 or other phosphodiesterase domain described 
herein, such as the amino acid sequence provided in Table 2. A binding site or 
phosphodiesterase domain can be represented in various ways, e.g., as representations of 
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atomic coordinates of residues around the binding site and/or as a binding site surface 
contour, and can include representations of the binding character of particular residues at 
the binding site, e.g., conserved residues. As for electronic representations of PDE5A, a 
binding compound or test compound may be present in the binding site; the binding site 
may be of a wild type, variant, mutant form, or modified form of PDE5A. 

[0064] In yet another aspect, the structural information of PDE5A can be used in a 
homology model (based on PDE5 A) for another phosphodiesterase (such as PDE6 or 
PDE1 1), thus providing an electronic representation of a PDE5A based homology model 
for a phosphodiesterase. For example, the homology model can utilize atomic coordinates 
from Table 1 for conserved amino acid residues. In particular embodiments; atomic 
coordinates for a wild type, variant, modified form, or mutated form of PDE5A can be 
used, including, for example, wild type, variants, modified forms, and mutant forms as 
described herein. In particular, PDE5 A structure provides a very close homology model 
for PDE6 and PDE1 1 . Thus, in particular embodiments the invention provides PDE5A- 
based homology models of PDE6 and PDE1 1. 

[0065] In still another aspect, the invention provides an electronic representation of a 
modified PDE5A crystal structure, that includes an electronic representation of the atomic 
coordinates of a modified PDE5A. In an exemplary embodiment, atomic coordinates of 
Table 1 can be modified by the replacement of atomic coordinates for a conserved residue 
with atomic coordinates for a different amino acid. Modifications can include 
substitutions, deletions (e.g., C-terminal and/or N-terminal delections), insertions (internal, 
C-terminal, and/or N-terminal) and/or side chain modifications. 

[0066] In another aspect, the PDE5A structural information provides a method for 
developing useful biological agents based on PDE5A, by analyzing a PDE5 A structure to 
identify at least one sub-structure for forming the biological agent. Such sub-structures 
can include epitopes for antibody formation, and the method includes developing 
antibodies against the epitopes, e.g., by injecting an epitope presenting composition in a 
mammal such as a rabbit, guinea pig, pig, goat, or horse. The sub-structure can also 
include a mutation site at which mutation is expected to or is known to alter the activity of 
the PDE5 A, and the method includes creating a mutation at that site. Still further, the sub- 
structure can include an attachment point for attaching a separate moiety, for example, a 
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peptide, a polypeptide, a solid phase material (e.g., beads, gels, chromatographic media, 
slides, chips, plates, and well surfaces), a linker, and a label {e.g., a direct label such as a 
fluorophore or an indirect label, such as biotin or other member of a specific binding pair). 
The method can include attaching the separate moiety. 

[0067] In another aspect, the invention provides a method for identifying potential 
PDE5A, binding compounds by fitting at least one electronic representation of a 
compound in an electronic representation of a PDE5A binding site. The representation of 
the binding site may be part of an electronic representation of a larger portion(s) or all of a 
PDE5A molecule or may be a representation of only the binding site or active site. The 
electronic representation may be as described above or otherwise described herein. For 
example, the compound may be a molecular scaffold, a derivative of a molecular scaffold, 
or a compound that is structurally similar to such molecular scaffold or derivative thereof. 

[0068] In particular embodiments, the method involves fitting a computer representation 
of a compound from a computer database with a computer representation of the active site 
of PDE5A, and involves removing a computer representation of a compound complexed 
with the PDE5 A molecule and identifying compounds that best fit the active site based on 
favorable geometric fit and energetically favorable complementary interactions as 
potential binding compounds. In particular embodiments, the compound is a known 
PDE5A inhibitor, e.g., as described in a reference cited herein, or a derivative thereof. 

[0069] In other embodiments, the method involves modifying a computer representation 
of a compound complexed with a PDE5A molecule, by the deletion or addition or both of 
one or more chemical groups; fitting a computer representation of a compound from a 
computer database with a computer representation of the active site of the PDE5A 
molecule; and identifying compounds that best fit the active site based on favorable 
geometric fit and energetically favorable complementary interactions as potential binding 
compounds. 

[0070] In still other embodiments, the method involves removing a computer 
representation of a compound complexed with PDE5A, and searching a database for 
compounds having structural similarity to the complexed compound using a compound 
searching computer program or replacing portions of the complexed compound with 
similar chemical structures using a compound construction computer program. 
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[0071] Fitting a compound can include determining whether a compound will interact 
with one or more conserved PDE5A active site residues. Compounds selected for fitting 
or that are complexed with PDE5 A can, for example, be a known PDE5 A inhibitor 
compound. 

[0072] In another aspect, the invention concerns a method for attaching a PDE5A 
binding compound to an attachment component, as well as a method for indentifying 
attachment sites on a PDE5A binding compound. The method involves identifying 
energetically allowed sites for attachment of an attachment component for the binding 
compound bound to a binding site of PDE5A; and attaching the compound or a derivative 
thereof to the attachment component at the energetically allowed site. 

[0073] Attachment components can include, for example, linkers (including traceless 
linkers) for attachment to a solid phase or to another molecule or other moiety. Such 
attachment can be formed by synthesizing the compound or derivative on the linker 
attached to a solid phase medium e.g., in a combinatorial synthesis in a plurality of 
compound. Likewise, the attachment to a solid phase medium can provide an affinity 
medium (e.g., for affinity chromatography). 

[0074] The attachment component can also include a label, which can be a directly 
detectable label such as a fluorophore, or an indirectly detectable such as a member of a 
specific binding pair, e.g., biotin. 

[0075] The ability to identify energetically allowed sites on a PDE5A binding 
compound, also, in a related aspect, provides modified binding compounds that have 
linkers attached, preferably at an energetically allowed site for binding of the modified 
compound to PDE5A. The linker can be attached to an attachment component as 
described above. 

[0076] Another aspect concerns a modified PDE5A polypeptide that includes a 
modification that makes the modified PDE5A more similar than native PDE5 A to another 
phosphodiesterase, and can also include other mutations or other modifications. In various 
embodiments, the polypeptide includes a full-length PDE5A polypeptide, includes a 
modified PDE5A binding site, includes at least 20, 30, 40, 50, 60, 70, or 80 contiguous 
amino acid residues derived from PDE5A including a conserved site. 
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[0077] Still another aspect of the invention concerns a method for developing a ligand 
for a phosphodiesterase that includes conserved residues matching any one, 2, 3, 4, 5, or 6 
of conserved PDE5A active site residues, by determining whether a compound binds to 
the phosphodiesterase and interacts with such active site residues in a PDE5 A crystal. The 
method can also include determining whether the compound modulates the activity of the 
phosphodiesterase. Preferably the phosphodiesterase has at least 50, 55, 60, or 70% 
identity over an equal length phosphodiesterase domain segment. 

[0078] In particular embodiments, the determining includes computer fitting the 
compound in a binding site of the phosphodiesterase and/or the method includes forming a 
co-crystal of the phosphodiesterase and the compound. Such co-crystals can be used for 
determing the binding orientation of the compound with the phosphodiesterase and/or 
provide structural information on the phosphodiesterase, e.g., on the binding site and 
interacting amino acid residues. Such binding orientation and/or other structural 
information can be accomplished using X-ray crystallography. 

[0079] The invention also provides compounds that bind to and/or modulate (e.g., 
inhibit) PDE5A, e.g., PDE5A phosphodiesterase activity. Accordingly, in aspects and 
embodiments involving PDE5A binding compounds, molecular scaffolds, and ligands or 
modulators, the compound is a weak binding compound; a moderate binding compound; a 
strong binding compound; the compound interacts with one or more conserved PDE5 A 
active site residues; the compound is a small molecule; the compound binds to a plurality 
of different phosphodiesterases (e.g., at least 2, 3, 4, 5, 7, 10, or more different 
phosphodiesterases). 

[0080] Additional aspects and embodiments will be apparent from the following 
Detailed Description and from the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0081] FIGURE 1 shows a ribbon diagram schematic representation of PDE5A 
phosphodiesterase domain having the sequence in Table 2. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0082] The Tables will first be briefly described. 

[0083] Table 1 provides atomic coordinates for human PDE5 A phosphodiesterase 
domain. In this table, the various columns have the following content, beginning with the 
left-most column: 

ATOM: Refers to the relevant moeity for the table row. 

Atom number: Refers to the arbitrary atom number designation within the coordinate 
table. 

Atom Name: Identifier for the atom present at the particular coordinates. 

Chain ID: Chain ID refers to one monomer of the protein in the crystal, e.g., chain "A", or 

to other compound present in the crystal, e.g., HOH for water, and L for a ligand or 

binding compound. Multiple copies of the protein monomers will have different chain Ids. 

Residue Number: The amino acid residue number in the chain. 

X, Y, Z: Respectively are the X, Y, and Z coordinate values. 

Occupancy: Describes the fraction of time the atom is observed in the crystal. For 
example, occupancy = 1 means that the atom is present all the time; occupancy = o.5 
indicates that the atom is present in the location 50% of the time. 
B-factor: A measure of the thermal motion of the atom. 
Element: Identifier for the element. 

[0084] Table 2 provides amino acid and nucleic acid sequences for a PDE5A 
phosphodiesterase domain. Numbering on the amino acid sequence does not correspond 
to standard numbering for native PDE5A. 

[0085] Table 3 provides an alignment of phosphodiesterase domains for several 
phosphodiesterases, including human PDE5 A, providing identification of residues 
conserved between various members of the set. 

[0086] Table 4 provides the nucleic acid and amino acid sequences for human PDE5A 
phosphodiesterase. 

I. General and PDE5 Inhibitors 
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[0087] The present invention concerns the use of PDE5A phosphodiesterase structures, 
structural information, and related compositions for identifying compounds that modulate 
PDE5A phosphodiesterase activity and for determining structuctures of other 
phosphodiesterases. 

[0088] PDE5 A is involved in a number of disease and conditions, and thus can be 
targeted in therapeutic and prophylactic methods. 

[0089] A large number of compounds that are active on PDE5, from several different 
chemical classes, have been identified, and pharmaceutical products directed to PDE5 
have been developed and approved by the Food and Drug Administration. Such 
compounds can be used in conjuction with crystal structure information on PDE5A to 
develop improved inhibitors. 

[0090] The following are among the examples of descriptions of such compounds. The 
compounds described in the publications listed can be used in the present invention to 
develop improved PDE5 inhibitors, e.g., inhibitors with improved affinity, activity, and/or 
specificity properties. Bunnage et al., U.S. Patent 6,333,330, U.S. Patent 6,407,1 14, and 
U.S. Patent Publication 2001/0039271, all entitled PYRAZOLOPYRIMIDINONE CGMP 
PDE5 INHIBITORS FOR THE TREATEMENT OF SEXUAL DYSFUNCTION, 
describe some pyrazolopyrimidinone compounds and their synthesis, preparation of 
pharmaceutical compositions, and administration. Fryburg et al., U.S. Patent Application 
Publication 2002/0165237, entitled TREATMENT OF THE INSULIN RESISTANCE 
SYNDROME, lists a variety of PDE5 inhibitors, including compounds described in EP-A- 
0463756, EP-A-0526004, WO 93/06104, 93/07149, WO 93/12095. WO 94/00453, WO 
98/49166, WO 99/54333, EP-A-0995751, WO 00/24745, EP-A-995750, WO 95/19978, 
and WO 93/07124, along with methods for formulating and administering pharmaceutical 
compositions. Bombrun, U.S. Patent 6,043,252, entitled CARBOLINE DERIVATIVES, 
describes PDE5 inhibitors that are carboline derivatives. Allerton, U.S. Patent Application 
Publication 2002/0173502, entitled PHARMACEUTIC ALLY ACTIVE COMPOUNDS, 
describes as PDE5 inhibitors certain compounds that include four heterocyclic groups. 
Sperl et al., U.S. Patent 6,066,634 describes substituted condensation products of N- 
benzyl-3-indenylacetamides herocyclic aldehydes and their use in treatment of neoplasias. 
Additional PDE5 inhibitors are described in Maw, U.S. Patent 6,503,908; Maw et al., U.S. 
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Patent 6,440,982; Daugan et al., U.S. Patent 6,143,757; Daugan et al., U.S. Patent 
6,143,746; Gonzalez et al., U.S. Patent Application Publication 2002/0058606. 
Benzimidazole derivatives with PDE5 inhibitor activity, and their preparation and use are 
described in Yamasaki et al., U.S. Patent 6,166,219. All of the above references are 
incorporated herein by reference in their entireties. 

Exemplary Diseases Associated with PDE5A. 

[0091] PDE5A has been correlated with several conditions in which inhibition of 
PDE5A is useful. Best known is the involvement of PDE5A in treatment of erectile 
dysfunction. Erection is largely a haemodynamic event that is regulated by fascular tone 
and blood-flow balance in the penis. Because cGMP levels modulate vascular tone, 
PDE5A is a useful target for intervention. When a man is sexually stimulated, nitric oxide 
(NO) is released from non-cholinergic, non-adrenergic neurons in the penis as well as 
from endothelial cells. NO diffuses into cells, where it activates soluble guanylyl cyclase, 
the enzyme that converts GTP to cGMP. The cGMP then stimulates PKG, which initiates 
a protein phosphorylation cascade. This results in a descrease in intracellular levels of 
cancium oins, leading ultimately to dilation of the arteries that bring blood to the penis and 
compression o the spongy corpus-cavernosum tissue. This compression contracts veins, 
which reduces the outflow of blood and increases intracavernosal pressure resulting in an 
erection. A PDE5A inhibitor retards enzymatic hydrolysis of cGMP in the corpus 
cavernosum, leading to the same outcome. (Rotella, 2002, Phosphodiesterase 5 inhibitors: 
Current status and potential applications, Nature Reviews 1 :674-682.) (See also, Taher et 
al, J, Urol 149:285A (1993); Murray, DN&P 6(3):150-156 (1993); Emmick et al., U.S. 
Patent 6,451,807, entitled METHODS OF TREATING SEXUAL DYSFUNCTION IN 
AN INDIVIDUAL SUFFEREING FROM A RETINAL DISEASE, CLASS 1 
CONGESTIVE HEART FAILURE, OR MYOCARDIAL INFARCTION USING A 
PDE5 INHBITOR.) 

[0092] In addition to treating erectile dysfunction, PDE inhibitors are described for use 
in treatment of premature ejaculation in individuals with normal erectile function. Boolell, 
U.S. Patent Application Publication 2002/0091 129. 



[0093] The use of PDE5A inhibitors in treatment of cystic fibrosis has also been 
indicated. 
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[0094] Treatment of Parkinson's Disease (PD) using PDE5 inhibitors has also been 
indicated. For example, Roylance, U.S. Patent 6,492,371, indicates that PDE5 inhibitors 
are useful in methods for preventing and/or slowing the progression of PD or reducing or 
eliminating clinical symptoms of PD. 

[0095] Watkins et aL, U.S. Patent Application Publication 2002/0128171 describes the 
use of PDE5 inhibitors to treat gastrointestinal disorders, such as disorders characterized 
by hypomobility or hypermobility of small intesting, large intestine, colon, esphagus, or 
stomach. 

[0096] The vasodilatory effects of PDE5A inhibitors allows their use in connection with 
some circulatory'disorders. In conjunction with a prostaglandin analogue (e.g., iloprost), a 
PDE5A inhibitor can enhance reduction of pulmonary arterial pressure, allowing such use 
in patients with pulmonary hypertension. 

[0097] Subarachnoid haemorrhage is a significant cause of stroke in many patients. It 
often occurs as a consequence of reduced responsiveness to NO in cerebral srteries. To 
counter this effect, PDE5 A inhibitors can elevate cellular levels of cGMP in cerbral 
arteries, thereby at least partially correcting the vascular dysfunction. 

[0098] Shahinpoor et al., U.S. Patent Application Publication 2002/0168424 describes 
the use of PDE5 inhibitors in conjunction with a nitric oxid donor for treatment of 
glaucoma. The publication indicates the drugs work synergistically to reduce intraocular 
pressure. 

[0099] PDE5A inhibitors also moderate platelet aggregation in a dose-dependent 
manner. 

[0100] Fryburg et al., U.S. Patent Application Publication 2002/0165,237, entitled 
TREATEMENT OF THE INSULIN RESISTANCE SYNDROME, describes the use of 
selective PDE5 inhibitors in the curative, palliative, or prophylactic treatment of insulin 
resistance syndrome (also referred to as Syndrome X and Metabolic Syndrome). Insulin 
resistance syndrome means the concomitant existence of two or more of: dyslipidemia, 
hypertension, type 2 diabetes mellitus or a family history of type 2 diabetes mellitus, 
hyperuricaemia, and/or gout, a pro-coagulant state, atheroslerosis, truncal obesity. 
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[0101] Thompson et al., U.S. Patent 6 5 130 5 053, entitled METHODS FOR SELECTING 
COMPOUNDS FOR INHIBITON OF NEOPLASTIC LESIONS, and Thompson et al., 
U.S. Patent Application Publication 2002/0009764, entitled METHODS FOR 
IDENTIFYING COMPOUNDS FOR INHIBITON OF NEOPLASTIC LESIONS, AND 
PHARMACEUTICAL COMPOSITIONS CONTAINING SUCH COMPOUNDS 
describes the use of PDE5 inhibitors in conjunction with inhibition of PDE2 activity, 
leading to cell apoptosis, and methods for identifying useful compounds. See also, 
Pamakcu et al., U.S. Patent 6,500,610, entitled METHODS FOR IDENTIFYING 
COMPOUNDS FOR INHIBITING NEOPLASTIC LESIONS, AND 
PHARMACEUTICAL COMPOSITIONS CONTAINING SUCH COMPOUNDS. 
Similarly, Whitehead, U.S. Patent 6,479,493 describes the use of PDE2 inhibition 
combined with PDE5 inhibition for treatment of Type 1 diabetes, and describes compouns 
fo that purpose. Use of combination PDE2 and PDE5 inhibition is also described in Earle 
et al., U.S. Patent 6,465,494, entitled METHODS FOR TREATMENT OF CYSTIC 
FIBROSIS. 

[0102] Bombrun, U.S. Patent 6,043,252 indicates that PDE5 inhibitors are useful for 
treatment of stable, unstable, and variant (Prinzmetal) angina, hypertension, pulmonary 
hypertenion, chronic obstructive pulmonary disease, congestive heart failure, acute 
respiratory distress syndrome, acute and chronic renal failure, atherosclerosis, conditions 
of reduced blood vessel patency (e.g., post-PTCA or post-bypass graft stenosis), peripheral 
vascular disease, vascular disorders such as Raynaud's disease, myocardial infarction, 
prophylaxis of stroke, stroke, bronchitis, chronic asthma, allergic asthma, allergic rhinitis, 
hypertrophy, , male and female erictile dysfunction, and diseases characterized by 
disorders of gut motility. 

[0103] Davies et al., U.S. Patent Application Publication 2002/0065286 describes the 
use of PDE5 inhibitors in wound treatment, such chronic wounds of non-diabetic origin, as 
well as acute wounds, such as in the elderly. 

[0104] The present methods can be used for developing ligands for treating one or more 
of the diseases and conditions above, or for other diseases or conditions for which PDE5 A 
modulation is found useful. 

II. Crystalline PDE5A 
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[0105] Crystalline PDE5A (e.g., human PDE5A) include native crystals, 
phosphodiesterase domain crystals, derivative crystals and co-crystals. The native crystals 
generally comprise substantially pure polypeptides corresponding to PDE5A in crystalline 
form. PDE5A phosphodiesterase domain crystals generally comprise substantially pure 
PDE5A phosphodiesterase domain in crystalline form. In connection with the 
development of inhibitors of PDE5A phosphodiesterase function, it is advantageous to use 
PDE5A phosphodiesterase domain for structural determination, because use of the reduced 
sequence simplifies structure determination. To be useful for this purpose, the 
phosphodiesterase domain should be active and/or retain native-type binding, thus 
indicating that the phosphodiesterase domain takes on substantially normal 3D structure. 

[0106] It is to be understood that the crystalline phosphodiesterases and 
phosphodiesterase domains of the invention are not limited to naturally occurring or native 
phosphodiesterase. Indeed, the crystals of the invention include crystals of mutants of 
native phosphodiesterases. Mutants of native phosphodiesterases are obtained by 
replacing at least one amino acid residue in a native phosphodiesterase with a different 
amino acid residue, or by adding or deleting amino acid residues within the native 
polypeptide or at the N- or C -terminus of the native polypeptide, and have substantially 
the same three-dimensional structure as the native phosphodiesterase from which the 
mutant is derived. 

[0107] By having substantially the same three-dimensional structure is meant having a 
set of atomic structure coordinates that have a root-mean-square deviation of less than or 
equal to about 2A when superimposed with the atomic structure coordinates of the native 
phosphodiesterase from which the mutant is derived when at least about 50% to 100% of 
the Ca atoms of the native phosphodiesterase domain are included in the superposition. 

[0108] Amino acid substitutions, deletions and additions which do not significantly 
interfere with the three-dimensional structure of the phosphodiesterase will depend, in 
part, on the region of the phosphodiesterase where the substitution, addition or deletion 
occurs. In highly variable regions of the molecule, non-conservative substitutions as well 
as conservative substitutions may be tolerated without significantly disrupting the three- 
dimensional, structure of the molecule. In highly conserved regions, or regions containing 
significant secondary structure, conservative amino acid substitutions are preferred. Such 
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conserved and variable regions can be identified by sequence alignment of PDE5A with 
other phosphodiesterases. Such alignment of PDE5A phosphodiesterase domain along 
with a number of other phosphodiesterase domains is provided in Table 3. 

[0109] Conservative amino acid substitutions are well known in the art, and include 
substitutions made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity and/or the amphipathic nature of the amino acid residues involved. For 
example, negatively charged amino acids include aspartic acid and glutamic acid; 
positively charged amino acids include lysine and arginine; amino acids with uncharged 
polar head groups having similar hydrophilicity values include the following: leucine, 
isoleucine, valine; glycine, alanine; asparagine, glutamine; serine, threonine; 
phenylalanine, tyrosine. Other conservative amino acid substitutions are well known in 
the art. 

[0110] For phosphodiesterases obtained in whole or in part by chemical synthesis, the 
selection of amino acids available for substitution or addition is not limited to the 
genetically encoded amino acids. Indeed, the mutants described herein may contain non- 
genetically encoded amino acids. Conservative amino acid substitutions for many of the 
commonly known non-genetically encoded amino acids are well known in the art. 
Conservative substitutions for other amino acids can be determined based on their physical 
properties as compared to the properties of the genetically encoded amino acids. 

[0111] In some instances, it may be particularly advantageous or convenient to 
substitute, delete and/or add amino acid residues to a native phosphodiesterase in order to 
provide convenient cloning sites in cDNA encoding the polypeptide, to aid in purification 
of the polypeptide, and for crystallization of the polypeptide. Such substitutions, deletions 
and/or additions which do not substantially alter the three dimensional structure of the 
native phosphodiesterase domain will be apparent to those of ordinary skill in the art. 

[0112] It should be noted that the mutants contemplated herein need not all exhibit 
phosphodiesterase activity. Indeed, amino acid substitutions, additions or deletions that 
interfere with the phosphodiesterase activity but which do not significantly alter the three- 
dimensional structure of the domain are specifically contemplated by the invention. Such 
crystalline polypeptides, or the atomic structure coordinates obtained therefrom, can be 



30 



Atty. Dkt. No.: 039363-1106 



used to identify compounds that bind to the native domain. These compounds can affect 
the activity of the native domain. 

[0113] The derivative crystals of the invention can comprise a crystalline 
phosphodiesterase polypeptide in covalent association with one or more heavy metal 
atoms. The polypeptide may correspond to a native or a mutated phosphodiesterase. 
Heavy metal atoms useful for providing derivative crystals include, by way of example 
and not limitation, gold, mercury, selenium, etc. 

[0114] The co-crystals of the invention generally comprise a crystalline 
phosphodiesterase domain polypeptide in association with one or more compounds. The 
association may be covalent or non-covalent. Such compounds include, but are not 
limited to, cofactors, substrates, substrate analogues, inhibitors, allosteric effectors, etc. 

[0115] Exemplary mutations for PDE5A family phosphodiesterases include mutations 
making the phosphodiesterase active site more like the active site of PDE6 or PDE1 1. 
Such insertion is useful, for example, to assist in using PDE5A to model PDE6 or PDE1 1 . 
Mutations at other sites can likewise be carried out, e.g., to make a mutated PDE5A more 
similar to another phosphodiesterase for structure modeling and/or compound fitting 
purposes, such as a phosphodiesterase in the phosphodiesterase domain alignment in Table 
3. 

[0116] In addition to the PDE5A crystal structure described herein, a crystal-based 
structure of PDE5A catalytic domain is described in Brown et al., PCT Application 
PCT/IB02/04426, International Publication WO 03/038080. That structure (and 
associated atomic coordinate sets), as well as other structures and atomic coordinate sets 
that may be obtained can also be used as described herein. 

III. Three Dimensional Structure Determination Using X-ray Crystallography 

[0117] X-ray crystallography is a method of solving the three dimensional structures of 
molecules. The structure of a molecule is calculated from X-ray diffraction patterns using 
a crystal as a diffraction grating. Three dimensional structures of protein molecules arise 
from crystals grown from a concentrated aqueous solution of that protein. The process of 
X-ray crystallography can include the following steps: 

(a) synthesizing and isolating (or otherwise obtaining) a polypeptide; 
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(b) growing a crystal from an aqueous solution comprising the polypeptide with 
or without a modulator; and 

(c) collecting X-ray diffraction patterns from the crystals, determining unit cell 
dimensions and symmetry, determining electron density, fitting the amino 
acid sequence of the polypeptide to the electron density, and refining the 
structure. 

Production of Polypeptides 

[0118] The native and mutated phosphodiesterase polypeptides described herein may be 
chemically synthesized in whole or part using techniques that are well-known in the art 
{see, e.g., Creighton (1983) Biopolymers 22(l):49-58). 

[0119] Alternatively, methods which are well known to those skilled in the art can be 
used to construct expression vectors containing the native or mutated phosphodiesterase 
polypeptide coding sequence and appropriate transcriptional/translational control signals. 
These methods include in vitro recombinant DNA techniques, synthetic techniques and in 
vivo recombination/genetic recombination. See, for example, the techniques described in 
Maniatis, T (1989). Molecular cloning: A laboratory Manual . Cold Spring Harbor 
Laboratory, New York. Cold Spring Harbor Laboratory Press; and Ausubel, F.M. et al. 
(1994) Current Protocols in Molecular Biology . John Wiley & Sons, Secaucus, NJ. 

[0120] A variety of host-expression vector systems may be utilized to express the 
phosphodiesterase coding sequence. These include but are not limited to microorganisms 
such as bacteria transformed with recombinant bacteriophage DNA, plasmid DNA or 
cosmid DNA expression vectors containing the phosphodiesterase domain coding 
sequence; yeast transformed with recombinant yeast expression vectors containing the 
phosphodiesterase domain coding sequence; insect cell systems infected with recombinant 
virus expression vectors (e.g., baculovirus) containing the phosphodiesterase domain 
coding sequence; plant cell systems infected with recombinant virus expression vectors 
(e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with 
recombinant plasmid expression vectors (e.g., Ti plasmid) containing the 
phosphodiesterase domain coding sequence; or animal cell systems. The expression 
elements of these systems vary in their strength and specificities. 
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[0121] Depending on the host/vector system utilized, any of a number of suitable 
transcription and translation elements, including constitutive and inducible promoters, may 
be used in the expression vector. For example, when cloning in bacterial systems, 
inducible promoters such as pL of bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid 
promoter) and the like may be used; when cloning in insect cell systems, promoters such 
as the baculovirus polyhedrin promoter may be used; when cloning in plant cell systems, 
promoters derived from the genome of plant cells (e.g., heat shock promoters; the 
promoter for the small subunit of RUBISCO; the promoter for the chlorophyll a/b binding 
protein) or from plant viruses (e.g., the 35S RNA promoter of CaMV; the coat protein 
promoter of TMV) may be used; when cloning in mammalian cell systems, promoters 
derived from the genome of mammalian cells (e.g., metallothionein promoter) or from 
mammalian viruses (e.g., the adenovirus late promoter; the vaccinia virus 7.5K promoter) 
may be used; when generating cell lines that contain multiple copies of the 
phosphodiesterase domain DNA, SV40-, BPV- and EBV-based vectors may be used with 
an appropriate selectable marker. 

[0122] Exemplary methods describing methods of DNA manipulation, vectors, various 
types of cells used, methods of incorporating the vectors into the cells, expression 
techniques, protein purification and isolation methods, and protein concentration methods 
are disclosed in detail in PCT publication WO 96/18738. This publication is incorporated 
herein by reference in its entirety, including any drawings. Those skilled in the art will 
appreciate that such descriptions are applicable to the present invention and can be easily 
adapted to it. 

Crystal Growth 

[0123] Crystals are grown from an aqueous solution containing the purified and 
concentrated polypeptide by a variety of techniques. These techniques include batch, 
liquid, bridge, dialysis, vapor diffusion, and hanging drop methods. McPherson (1982) 
John Wiley, New York; McPherson (1990) Eur. J. Biochem. 189:1-23; Webber (1991) 
Adv. Protein Chem. 41:1-36, incorporated by reference herein in their entireties, including 
all figures, tables, and drawings. 

[0124] The native crystals of the invention are, in general, grown by adding precipitants 
to the concentrated solution of the polypeptide. The precipitants are added at a 
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concentration just below that necessary to precipitate the protein. Water is removed by 
controlled evaporation to produce precipitating conditions, which are maintained until 
crystal growth ceases. 

[0125] For crystals of the invention, exemplary crystallization conditions are described 
in the Examples. Those of ordinary skill in the art will recognize that the exemplary 
crystallization conditions can be varied. Such variations may be used alone or in 
combination. In addition, other crystallization conditions may be found, e.g., by using 
crystallization screening plates to identify such other conditions. Those alternate 
conditions can then be optimized if needed to provide larger or better quality crystals. 

[0126] Derivative crystals of the invention can be obtained by soaking native crystals in 
mother liquor containing salts of heavy metal atoms. It has been found that soaking a 
native crystal in a solution containing about 0.1 mM to about 5 mM thimerosal, 4- 
chloromeruribenzoic acid or KAu(CN)2 for about 2 hr to about 72 hr provides derivative 
crystals suitable for use as isomorphous replacements in determining the X-ray crystal 
structure of PDE5A. 

[0127] Co-crystals of the invention can be obtained by soaking a native crystal in mother 
liquor containing compound that binds the phosphodiesterase, or can be obtained by co- 
crystallizing the phosphodiesterase polypeptide in the presence of a binding compound. 

[0128] Generally, co-crystallization of phosphodiesterase and binding compound can be 
accomplished using conditions identified for crystallizing the corresponding 
phosphodiesterase without binding compound. It is advantageous if a plurality of different 
crystallization conditions have been identified for the phosphodiesterase, and these can be 
tested to determine which condition gives the best co-crystals. It may also be benficial to 
optimize the conditions for co-crystallization. Alternatively, new crystallization 
conditions can be determined for obtaining co-crystals, e.g., by screening for 
crystallization and then optimizing those conditions. Exemplary co-crystallization 
conditions are provided in the Examples. 

Determining Unit Cell Dimensions and the Three Dimensional Structure of a 
Polypeptide or Polypeptide Complex 
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[0129] Once the crystal is grown, it can be placed in a glass capillary tube or other 
mounting device and mounted onto a holding device connected to an X-ray generator and 
an X-ray detection device. Collection of X-ray diffraction patterns are well documented 
by those in the art. See, e.g., Ducruix and Geige, (1992), IRL Press, Oxford, England, and 
references cited therein. A beam of X-rays enters the crystal and then diffracts from the 
crystal. An X-ray detection device can be utilized to record the diffraction patterns 
emanating from the crystal. Although the X-ray detection device on older models of these 
instruments is a piece of film, modern instruments digitally record X-ray diffraction 
scattering. X-ray sources can be of various types, but advantageously, a high intensity 
source is used, e.g., a synchrotron beam source. 

[0130] Methods for obtaining the three dimensional structure of the crystalline form of a 
peptide molecule or molecule complex are well known in the art. See, e.g., Ducruix and 
Geige, (1992), IRL Press, Oxford, England, and references cited therein. The following 
are steps in the process of determining the three dimensional structure of a molecule or 
complex from X-ray diffraction data. 

[0131] After the X-ray diffraction patterns are collected from the crystal, the unit cell 
dimensions and orientation in the crystal can be determined. They can be determined from 
the spacing between the diffraction emissions as well as the patterns made from these 
emissions. The unit cell dimensions are characterized in three dimensions in units of 
Angstroms (one A= 10" 10 meters) and by angles at each vertices. The symmetry of the unit 
cell in the crystals is also characterized at this stage. The symmetry of the unit cell in the 
crystal simplifies the complexity of the collected data by identifying repeating patterns. 
Application of the symmetry and dimensions of the unit cell is described below. 

[0132] Each diffraction pattern emission is characterized as a vector and the data 
collected at this stage of the method determines the amplitude of each vector. The phases 
of the vectors can be determined using multiple techniques. In one method, heavy atoms 
can be soaked into a crystal, a method called isomorphous replacement, and the phases of 
the vectors can be determined by using these heavy atoms as reference points in the X-ray 
analysis. (Otwinowski, (1991), Daresbury, United Kingdom, 80-86). The isomorphous 
replacement method usually utilizes more than one heavy atom derivative. 
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[0133] In another method, the amplitudes and phases of vectors from a crystalline 
polypeptide with an already determined structure can be applied to the amplitudes of the 
vectors from a crystalline polypeptide of unknown structure and consequently determine 
the phases of these vectors. This second method is known as molecular replacement and 
the protein structure which is used as a reference must have a closely related structure to 
the protein of interest. (Naraza (1994) Proteins 1 1 :28 1-296). Thus, the vector 
information from a phosphodiesterase of known structure, such as those reported herein, 
are useful for the molecular replacement analysis of another phosphodiesterase with 
unknown structure. 

[0134] Once the phases of the vectors describing the unit cell of a crystal are determined, 
the vector amplitudes and phases, unit cell dimensions, and unit cell symmetry can be used 
as terms in a Fourier transform function. The Fourier transform function calculates the 
electron density in the unit cell from these measurements. The electron density that 
describes one of the molecules or one of the molecule complexes in the unit cell can be 
referred to as an electron density map. The amino acid structures of the sequence or the 
molecular structures of compounds complexed with the crystalline polypeptide may then 
be fitted to the electron density using a variety of computer programs. This step of the 
process is sometimes referred to as model building and can be accomplished by using 
computer programs such as Turbo/FRODO or "O". (Jones (1985) Methods in Enzymology 
115:157-171). 

[0135] A theoretical electron density map can then be calculated from the amino acid 
structures fit to the experimentally determined electron density. The theoretical and 
experimental electron density maps can be compared to one another and the agreement 
between these two maps can be described by a parameter called an R-factor. A low value 
for an R-factor describes a high degree of overlapping electron density between a 
theoretical and experimental electron density map. 

[0136] The R-factor is then minimized by using computer programs that refine the 
theoretical electron density map. A computer program such as X-PLOR can be used for 
model refinement by those skilled in the art. (Briinger (1992) Nature 355:472-475.) 
Refinement may be achieved in an iterative process. A first step can entail altering the 
conformation of atoms defined in an electron density map. The conformations of the 
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atoms can be altered by simulating a rise in temperature, which will increase the 
vibrational frequency of the bonds and modify positions of atoms in the structure. At a 
particular point in the atomic perturbation process, a force field, which typically defines 
interactions between atoms in terms of allowed bond angles and bond lengths, Van der 
Waals interactions, hydrogen bonds, ionic interactions, and hydrophobic interactions, can 
be applied to the system of atoms. Favorable interactions may be described in terms of 
free energy and the atoms can be moved over many iterations until a free energy minimum 
is achieved. The refinement process can be iterated until the R-factor reaches a minimum 
value. 

[0137] The three dimensional structure of the molecule or molecule complex is 
described by atoms that fit the theoretical electron density characterized by a minimum R- 
value. A file can then be created for the three dimensional structure that defines each atom 
by coordinates in three dimensions. An example of such a structural coordinate file is 
shown in Table 1 . 

IV. Structures of PDE5A 

[0138] The present invention provides high-resolution three-dimensional structures and 
atomic structure coordinates of crystalline PDE5A phosphodiesterase domain and PDE5A 
phosphodiesterase domain co-complexed with exemplary binding compounds as 
determined by X-ray crystallography. The methods used to obtain the structure 
coordinates are provided in the examples. The atomic structure coordinates of crystalline 
PDE5A are listed in Table 1. Co-crystal coordinates can be used in the same way, e.g., in 
the various aspects described herein, as coordinates for the protein by itself. 

[0139] Those having skill in the art will recognize that atomic structure coordinates as 
determined by X-ray crystallography are not without error. Thus, it is to be understood 
that any set of structure coordinates obtained for crystals of PDE5A, whether native 
crystals, phosphodiesterase domain crystals, derivative crystals or co-crystals, that have a 
root mean square deviation ("r.m.s.d.") of less than or equal to about 1.5 A when 
superimposed, using backbone atoms (N, C a , C and 0), on the structure coordinates listed 
in Table 1 are considered to be identical with the structure coordinates listed in the Table 1 
when at least about 50% to 100% of the backbone atoms of PDE5A are included in the 
superposition. 
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[0140] As indicated above, a crystal-based PDE5A catalytic domain structure is 
described in Brown et al., PCT Application PCTVIB02/04426, International Publication 
WO 03/038080. 

V. Uses of the Crystals and Atomic Structure Coordinates 

[0141] The crystals of the invention, and particularly the atomic structure coordinates 
obtained therefrom, have a wide variety of uses. For example, the crystals described 
herein can be used as a starting point in any of the methods of use for phosphodiesterases 
known in the art or later developed. Such methods of use include, for example, 
identifying molecules that bind to the native or mutated catalytic domain of 
phosphodiesterases. The crystals and structure coordinates are particularly useful for 
identifying ligands that modulate phosphodiesterase activity as an approach towards 
developing new therapeutic agents. In particular, the crystals and structural information 
are useful in methods for ligand development utilizing molecular scaffolds. 

[0142] The structure coordinates described herein can be used as phasing models for 
determining the crystal structures of additional phosphodiesterases, as well as the 
structures of co-crystals of such phosphodiesterases with ligands such as inhibitors, 
agonists, antagonists, and other molecules. The structure coordinates, as well as models of 
the three-dimensional structures obtained therefrom, can also be used to aid the elucidation 
of solution-based structures of native or mutated phosphodiesterases, such as those 
obtained via NMR. 

VI. Electronic Representations of Phosphodiesterase Structures 

[0143] Structural information of phosphodiesterases or portions of phosphodiesterases 
(e.g., phosphodiesterase active sites) can be represented in many different ways. 
Particularly useful are electronic representations, as such representations allow rapid and 
convenient data manipulations and structural modifications. Electronic representations can 
be embedded in manydifferent storage or memory media, frequently computer readable 
media. Examples include without limitations, computer random access memory (RAM), 
floppy disk, magnetic hard drive, magnetic tape (analog or digital), compact disk (CD), 
optical disk, CD-ROM, memory card, digital video disk (DVD), and others. The storage 
medium can be separate or part of a computer system. Such a computer system may be a 
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dedicated, special purpose, or embedded system, such as a computer system that forms 
part of an X-ray crystallography system, or may be a general purpose computer (which 
may have data connection with other equipment such as a sensor device in an X-ray 
crystallographic system. In many cases, the information provided by such electronic 
representations can also be represented physically or visually in two or three dimensions, 
e.g., on paper, as a visual display (e.g., on a computer monitor as a two dimensional or 
pseudo-three dimensional image) or as a three dimensional physical model Such physical 
representations can also be used, alone or in connection with electronic representations. 
Exemplary useful representations include, but are not limited to, the following: 

Atomic Coordinate Representation 

[0144] One type of representation is a list or table of atomic coordinates representing 
positions of particular atoms in a molecular structure, portions of a structure, or complex 
(e.g., a co-crystal). Such a representation may also include additional information, for 
example, information about occupancy of particular coordinates. One such atomic 
coordinate representation contains the coordinate information of Table 1 in electronic 
form. 

Energy Surface or Surface of Interaction Representation 

[0145] Another representation is an energy surface representation, e.g., of an active site 
or other binding site, representing an energy surface for electronic and steric interactions. 
Such a representation may also include other features. An example is the inclusion of 
representation of a particular amino acid residue(s) or group(s) on a particular amino acid 
residue(s), e.g. , a residue or group that can participate in H-bonding or ionic interaction. 
Such energy surface representations can be readily generated from atomic coordinate 
representations using any of a variety of available computer programs. 

Structural Representation 

[0146] Still another representation is a structural representation, i.e., a physical 

representation or an electronic representation of such a physical representation. Such a 

structural representation includes representations of relative positions of particular features 

of a molecule or complex, often with linkage between structural features. For example, a 

structure can be represented in which all atoms are linked; atoms other than hydrogen are 

linked; backbone atoms, with or without representation of sidechain atoms that could 
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participate in significant electronic interaction, are linked; among others. However, not all 
features need to be linked. For example, for structural representations of portions of a 
molecule or complex, structural features significant for that feature may be represented 
(e.g., atoms of amino acid residues that can have significant binding interation with a 
ligand at a binding site. Those amino acid residues may not be linked with each other. 

[0147] A structural representation can also be a schematic representation. For example, 
a schematic representation can represent secondary and/or tertiary structure in a schematic 
manner. Within such a schematic representation of a polypeptide, a particular amino acid 
residue(s) or group(s) on a residue(s) can be included, e.g., conserved residues in a binding 
site, and/or residue(s) or group(s) that may interact with binding compounds. Electronic 
structural representations can be generated, for example, from atomic coordinate 
information using computer programs designed for that function and/or by constructing an 
electronic representation with manual input based on interpretation of another form of 
structural information. Physical representations can be created, for example, by printing 
an image of a computer-generated image or by constructing a 3D model. An example of 
such a printed representation is the ribbon diagram presented in Figure 1 . 

VII. Structure Determination for Phosphodiesterases with Unknown Structure 
Using Structural Coordinates 

[0148] Structural coordinates, such as those set forth in Table 1, can be used to 
determine the three dimensional structures of phosphodiesterases with unknown structure. 
The methods described below can apply structural coordinates of a polypeptide with 
known structure to another data set, such as an amino acid sequence, X-ray 
crystallographic diffraction data, or nuclear magnetic resonance (NMR) data. Preferred 
embodiments of the invention relate to determining the three dimensional structures of 
other PDE5 A phosphodiesterases, other phosphodiesterases, and related polypeptides. 

Structures Using Amino Acid Homology 

[0149] Homology modeling is a method of applying structural coordinates of a 

polypeptide of known structure to the amino acid sequence of a polypeptide of unknown 

structure. This method is accomplished using a computer representation of the three 

dimensional structure of a polypeptide or polypeptide complex, the computer 

representation of amino acid sequences of the polypeptides with known and unknown 
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structures, and standard computer representations of the structures of amino acids. 
Homology modeling generally involves (a) aligning the amino acid sequences of the 
polypeptides with and without known structure; (b) transferring the coordinates of the 
conserved amino acids in the known structure to the corresponding amino acids of the 
polypeptide of unknown structure; refining the subsequent three dimensional structure; 
and (d) constructing structures of the rest of the polypeptide. One skilled in the art 
recognizes that conserved amino acids between two proteins can be determined from the 
sequence alignment step in step (a). 

[0150] The above method is well known to those skilled in the art. (Greer (1985) Science 
228:1055; Blundell et al A(1988) Eur. J. Biochem. 172:513. An exemplary computer 
program that can be utilized for homology modeling by those skilled in the art is the 
Homology module in the Insight II modeling package distributed by Accelerys Inc. 

[0151] Alignment of the amino acid sequence is accomplished by first placing the 
computer representation of the amino acid sequence of a polypeptide with known structure 
above the amino acid sequence of the polypeptide of unknown structure. Amino acids in 
the sequences are then compared and groups of amino acids that are homologous (e.g., 
amino acid side chains that are similar in chemical nature - aliphatic, aromatic, polar, or 
charged) are grouped together. This method will detect conserved regions of the 
polypeptides and account for amino acid insertions or deletions. Such alignment and/or 
can also be performed fully electronically using sequence alignment and analyses 
software. 

[0152] Once the amino acid sequences of the polypeptides with known and unknown 
structures are aligned, the structures of the conserved amino acids in the computer 
representation of the polypeptide with known structure are transferred to the 
corresponding amino acids of the polypeptide whose structure is unknown. For example, 
a tyrosine in the amino acid sequence of known structure may be replaced by a 
phenylalanine, the corresponding homologous amino acid in the amino acid sequence of 
unknown structure. 

[0153] The structures of amino acids located in non-conserved regions are to be assigned 
manually by either using standard peptide geometries or molecular simulation techniques, 
such as molecular dynamics. The final step in the process is accomplished by refining the 
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entire structure using molecular dynamics and/or energy minimization. The homology 
modeling method is well known to those skilled in the art and has been practiced using 
different protein molecules. For example, the three dimensional structure of the 
polypeptide corresponding to the catalytic domain of a serine/threonine protein kinase, 
myosin light chain protein kinase, was homology modeled from the cAMP-dependent 
protein kinase catalytic subunit. (Knighton et al (1992) Science 258:130-135.) 

Structures Using Molecular Replacement 

[0154] Molecular replacement is a method of applying the X-ray diffraction data of a 
polypeptide of known structure to the X-ray diffraction data of a polypeptide of unknown 
sequence. This method can be utilized to define the phases describing the X-ray diffraction 
data of a polypeptide of unknown structure when only the amplitudes are known. X-PLOR 
is a commonly utilized computer software package used for molecular replacement. 
Brunger (1992) Nature 355:472-475. AMORE is another program used for molecular 
replacement. Navaza (1994) Acta Crystallogr. A50:157-163. Preferably, the resulting 
structure does not exhibit a root-mean-square deviation of more than 3A. 

[0155] A goal of molecular replacement is to align the positions of atoms in the unit cell 
by matching electron diffraction data from two crystals. A program such as X-PLOR can 
involve four steps. A first step can be to determine the number of molecules in the unit cell 
and define the angles between them. A second step can involve rotating the diffraction 
data to define the orientation of the molecules in the unit cell. A third step can be to 
translate the electron density in three dimensions to correctly position the molecules in the 
unit cell. Once the amplitudes and phases of the X-ray diffraction data is determined, an 
R-factor can be calculated by comparing electron diffraction maps calculated 
experimentally from the reference data set and calculated from the new data set. An R- 
factor between 30-50% indicates that the orientations of the atoms in the unit cell are 
reasonably determined by this method. A fourth step in the process can be to decrease the 
R-factor to roughly 20% by refining the new electron density map using iterative 
refinement techniques described herein and known to those or ordinary skill in the art. 

Structures Using NMR Data 

[0156] Structural coordinates of a polypeptide or polypeptide complex derived from X- 
ray crystallographic techniques can be applied towards the elucidation of three 
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dimensional structures of polypeptides from nuclear magnetic resonance (NMR) data. This 
method is used by those skilled in the art. (Wuthrich, (1986), John Wiley and Sons, New 
York:176-199; Pflugrath et al (1986) J. Mol Biol 189:383-386; Kline et al (1986) J. 
Mol Biol. 189:377-382.) While the secondary structure of a polypeptide is often readily 
determined by utilizing two-dimensional NMR data, the spatial connections between 
individual pieces of secondary structure are not as readily determinable. The coordinates 
defining a three-dimensional structure of a polypeptide derived from X-ray 
crystallographic techniques can guide the NMR spectroscopist to an understanding of 
these spatial interactions between secondary structural elements in a polypeptide of related 
structure. 

[0157] The knowledge of spatial interactions between secondary structural elements can 
greatly simplify Nuclear Overhauser Effect (NOE) data from two-dimensional NMR 
experiments. Additionally, applying the crystallographic coordinates after the 
determination of secondary structure by NMR techniques only simplifies the assignment 
of NOEs relating to particular amino acids in the polypeptide sequence and does not 
greatly bias the NMR analysis of polypeptide structure. Conversely, using the 
crystallographic coordinates to simplify NOE data while determining secondary structure 
of the polypeptide would bias the NMR analysis of protein structure. 

VIII. Structure-Based Design of Modulators of Phosphodiesterase Function 
Utilizing Structural Coordinates 

[0158] Structure-based modulator design and identification methods are powerful 
techniques that can involve searches of computer databases containing a wide variety of 
potential modulators and chemical functional groups. The computerized design and 
identification of modulators is useful as the computer databases contain more compounds 
than the chemical libraries, often by an order of magnitude. For reviews of structure-based 
drug design and identification {see Kuntz et al (1994), Acc. Chem. Res. 27:1 17; Guida 
(1994) Current Opinion in Struc. Biol 4: 777; Colman (1994) Current Opinion in Struc. 
Biol 4: 868). 

[0159] The three dimensional structure of a polypeptide defined by structural 
coordinates can be utilized by these design methods, for example, the structural 
coordinates of Table 1 . In addition, the three dimensional structures of phosphodiesterases 
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determined by the homology, molecular replacement, and NMR techniques described 
herein can also be applied to modulator design and identification methods. 

[0160] For identifying modulators, structural information for a native phosphodiesterase, 
in particular, structural information for the active site of the phosphodiesterase, can be 
used. However, it may be advantageous to utilize structural information from one or more 
co-crystals of the phosphodiesterase with one or more binding compounds. It can also be 
advantageous if the binding compound has a structural core in common with test 
compounds. 

Design by Searching Molecular Data Bases 

[0161] One method of rational design searches for modulators by docking the computer 
representations of compounds from a database of molecules. Publicly available databases 
include, for example: 



a) 


ACD from Molecular Designs Limited 


b) 


NCI from National Cancer Institute 


c) 


CCDC from Cambridge Crystallographic Data Center 


d) 


CAST from Chemical Abstract Service 


e) 


Derwent from Derwent Information Limited 


f) 


Maybridge from Maybridge Chemical Company LTD 


g) 


Aldrich from Aldrich Chemical Company 


h) 


Directory of Natural Products from Chapman & Hall 



[0162] One such data base (ACD distributed by Molecular Designs Limited Information 
Systems) contains compounds that are synthetically derived or are natural products. 
Methods available to those skilled in the art can convert a data set represented in two 
dimensions to one represented in three dimensions. These methods are enabled by such 
computer programs as CONCORD from Tripos Associates or DE-Converter from 
Molecular Simulations Limited. 

[0163] Multiple methods of structure-based modulator design are known to those in the 
art. (Kuntz et al, (1982), J. Mol Biol 162: 269; Kuntz et aZ., (1994), Acc. Chern. Res. 
27: 117; Meng et al, (1992), J. CompL Chem. 13: 505; Bohm, (1994), J. Comp. Aided 
Molec. Design 8: 623.) 
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[0164] A computer program widely utilized by those skilled in the art of rational 
modulator design is DOCK from the University of California in San Francisco. The 
general methods utilized by this computer program and programs like it are described in 
three applications below. More detailed information regarding some of these techniques 
can be found in the Accelerys User Guide, 1995. A typical computer program used for 
this purpose can perform a processes comprising the following steps or functions: 

(a) remove the existing compound from the protein; 

(b) dock the structure of another compound into the active-site using the computer 

program (such as DOCK) or by interactively moving the compound into the 
active-site; 

(c) characterize the space between the compound and the active-site atoms; 

(d) search libraries for molecular fragments which (i) can fit into the empty space 

between the compound and the active-site, and (ii) can be linked to the 
compound; and 

(e) link the fragments found above to the compound and evaluate the new modified 

compound. 

[0165] Part (c) refers to characterizing the geometry and the complementary interactions 
formed between the atoms of the active site and the compounds. A favorable geometric fit 
is attained when a significant surface area is shared between the compound and active-site 
atoms without forming unfavorable steric interactions. One skilled in the art would note 
that the method can be performed by skipping parts (d) and (e) and screening a database of 
many compounds. 

[0166] Structure-based design and identification of modulators of phosphodiesterase 
function can be used in conjunction with assay screening. As large computer databases of 
compounds (around 10,000 compounds) can be searched in a matter of hours or even less, 
the computer-based method can narrow the compounds tested as potential modulators of 
phosphodiesterase function in biochemical or cellular assays. 

[0167] The above descriptions of structure-based modulator design are not all 
encompassing and other methods are reported in the literature and can be used, e.g. : 
(1) CAVEAT: Bartlett et al.,(1989), in Chemical and Biological Problems in 

Molecular Recognition, Roberts, S.M.; Ley, S.V.; Campbell, M.M. eds.; Royal 
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Society of Chemistry: Cambridge, pp.182-196. 

(2) FLOG: Miller et al., (1994), J. Comp. Aided Molec. Design 8:153. 

(3) PRO Modulator: Clark et al., (1995), J. Comp, Aided Molec. Design 9:13. 

(4) MCSS: Miranker and Karplus, (1991), Proteins: Structure, Function, and 

Genetics 11:29. 

(5) AUTODOCK: Goodsell and Olson, (1990), Proteins: Structure, Function, and 

Genetics 8:195. 

(6) GRID: Goodford, (1985), J. Med. Chem. 28:849. 

Design by Modifying Compounds in Complex with PDE5A 

[0168] Another way of identifying compounds as potential modulators is to modify an 
existing modulator in the polypeptide active site. For example, the computer 
representation of modulators can be modified within the computer representation of a 
PDE5A active site. Detailed instructions for this technique can be found, for example, in 
the Accelerys User Manual, 1995 in LUDI. The computer representation of the modulator 
is typically modified by the deletion of a chemical group or groups or by the addition of a 
chemical group or groups. 

[0169] Upon each modification to the compound, the atoms of the modified compound 
and active site can be shifted in conformation and the distance between the modulator and 
the active-site atoms may be scored along with any complementary interactions formed 
between the two molecules. Scoring can be complete when a favorable geometric fit and 
favorable complementary interactions are attained. Compounds that have favorable scores 
are potential modulators. 

Design by Modifying the Structure of Compounds that Bind PDE5A 

[0170] A third method of structure-based modulator design is to screen compounds 
designed by a modulator building or modulator searching computer program. Examples of 
these types of programs can be found in the Molecular Simulations Package, Catalyst. 
Descriptions for using this program are documented in the Molecular Simulations User 
Guide (1995). Other computer programs used in this application are ISIS/HOST, 
ISIS/BASE, ISIS/DRAW) from Molecular Designs Limited and UNITY from Tripos 
Associates. 



46 



Atty. Dkt. No.: 039363-1106 



[0171] These programs can be operated on the structure of a compound that has been 
removed from the active site of the three dimensional structure of a compound- 
phosphodiesterase complex. Operating the program on such a compound is preferable 
since it is in a biologically active conformation. 

[0172] A modulator construction computer program is a computer program that may be 
used to replace computer representations of chemical groups in a compound complexed 
with a phosphodiesterase or other biomolecule with groups from a computer database. A 
modulator searching computer program is a computer program that may be used to search 
computer representations of compounds from a computer data base that have similar three 
dimensional structures and similar chemical groups as compound bound to a particular 
biomolecule. 

[0173] A typical program can operate by using the following general steps: 

(a) map the compounds by chemical features such as by hydrogen bond donors or 

acceptors, hydrophobic/lipophilic sites, positively ionizable sites, or negatively 
ionizable sites; 

(b) add geometric constraints to the mapped features; and 

(c) search databases with the model generated in (b). 

[0174] Those skilled in the art also recognize that not all of the possible chemical 
features of the compound need be present in the model of (b). One can use any subset of 
the model to generate different models for data base searches. 

Modulator Design Using Molecular Scaffolds 

[0175] The present invention can also advantageously utilize methods for designing 
compounds, designated as molecular scaffolds, that can act broadly across families of 
molecules and/or for using a molecular scaffold to design ligands that target individual or 
multiple members of those families. Such design using molecular scaffolds is described in 
Hirth and Milburn, U.S. Patent Application 10/377,268, which is incorporated herein by 
reference in its entirety. Such design and development using molecular scaffolds is 
described, in part, below. 

[0176] In preferred embodiments, the molecules can be proteins and a set of chemical 
compounds can be assembled that have properties such that they are 1) chemically 
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designed to act on certain protein families and/or 2) behave more like molecular scaffolds, 
meaning that they have chemical substructures that make them specific for binding to one 
or more proteins in a family of interest. Alternatively, molecular scaffolds can be 
designed that are preferentially active on an individual target molecule. 

[0177] Useful chemical properties of molecular scaffolds can include one or more of the 
following characteristics, but are not limited thereto: an average molecular weight below 
about 350 daltons, or between from about 150 to about 350 daltons, or from about 150 to 
about 300 daltons; having a clogP below 3; a number of rotatable bonds of less than 4; a 
number of hydrogen bond donors and acceptors below 5 or below 4; a polar surface area 
of less than 50 A 2 ; binding at protein binding sites in an orientation so that chemical 
substituents from a combinatorial library that are attached to the scaffold can be projected 
into pockets in the protein binding site; and possessing chemically tractable structures at 
its substituent attachment points that can be modified, thereby enabling rapid library 
construction. 

[0178] By "clog P" is meant the calculated log P of a compound, "P" referring to the 
partition coefficient between octanol and water. 

[0179] The term "Molecular Polar Surface Area (PSA)" refers to the sum of surface 
contributions of polar atoms (usually oxygens, nitrogens and attached hydrogens) in a 
molecule. The polar surface area has been shown to correlate well with drug transport 
properties, such as intestinal absorption, or blood-brain barrier penetration. 

[0180] Additional useful chemical properties of distinct compounds for inclusion in a 
combinatorial library include the ability to attach chemical moieties to the compound that 
will not interfere with binding of the compound to at least one protein of interest, and that 
will impart desirable properties to the library members, for example, causing the library 
members to be actively transported to cells and/or organs of interest, or the ability to 
attach to a device such as a chromatography column (e.g., a streptavidin column through a 
molecule such as biotin) for uses such as tissue and proteomics profiling purposes. 

[0181] A person of ordinary skill in the art will realize other properties that can be 
desirable for the scaffold or library members to have depending on the particular 
requirements of the use, and that compounds with these properties can also be sought and 
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identified in like manner. Methods of selecting compounds for assay are known to those 
of ordinary skill in the art, for example, methods and compounds described in U.S. Patent 
No. 6,288,234, 6,090,912, 5,840,485, each of which is hereby incorporated by reference in 
its entirety, including all charts and drawings. 

[0182] In various embodiments, the present invention provides methods of designing 
ligands that bind to a plurality of members of a molecular family, where the ligands 
contain a common molecular scaffold. Thus, a compound set can be assayed for binding 
to a plurality of members of a molecular family, e.g., a protein family. One or more 
compounds that bind to a plurality of family members can be identified as molecular 
scaffolds. When the orientation of the scaffold at the binding site of the target molecules 
has been determined and chemically tractable structures have been identified, a set of 
ligands can be synthesized starting with one or a few molecular scaffolds to arrive at a 
plurality of ligands, wherein each ligand binds to a separate target molecule of the 
molecular family with altered or changed binding affinity or binding specificity relative to 
the scaffold. Thus, a plurality of drug lead molecules can be designed to preferentially 
target individual members of a molecular family based on the same molecular scaffold, 
and act on them in a specific manner. 

IX. Binding Assays 

[0183] The methods of the present invention can involve assays that are able to detect 
the binding of compounds to a target molecule. Such binding is at a statistically 
significant level, preferably with a confidence level of at least 90%, more preferably at 
least 95, 97, 98, 99% or greater confidence level that the assay signal represents binding to 
the target molecule, i. e. , is distinguished from background. Preferably controls are used to 
distinguish target binding from non-specific binding. The assays of the present invention 
can also include assaying compounds for low affinity binding to the target molecule. A 
large variety of assays indicative of binding are known for different target types and can 
be used for this invention. Compounds that act broadly across protein families are not 
likely to have a high affinity against individual targets, due to the broad nature of their 
binding. Thus, assays described herein allow for the identification of compounds that bind 
with low affinity, very low affinity, and extremely low affinity. Therefore, potency (or 
binding affinity) is not the primary, nor even the most important, indicia of identification 
of a potentially useful binding compound. Rather, even those compounds that bind with 
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low affinity, very low affinity, or extremely low affinity can be considered as molecular 
scaffolds that can continue to the next phase of the ligand design process. 

[0184] By binding with "low affinity" is meant binding to the target molecule with a 
dissociation constant (kd) of greater than 1 jjM under standard conditions. By binding 
with "very low affinity" is meant binding with a kd of above about 100 under standard 
conditions. By binding with "extremely low affinity" is meant binding at a kd of above 
about 1 mM under standard conditions. By "moderate affinity" is meant binding with a kd 
of from about 200 nM to about 1 jxM under standard conditions. By "moderately high 
affinity" is meant binding at a kd of from about 1 nM to about 200 nM. By binding at 
"high affinity" is meant binding at a kd of below about 1 nM under standard conditions. 
For example, low affinity binding can occur because of a poorer fit into the binding site of 
the target molecule or because of a smaller number of non-covalent bonds, or weaker 
covalent bonds present to cause binding of the scaffold or ligand to the binding site of the 
target molecule relative to instances where higher affinity binding occurs. The standard 
conditions for binding are at pH 7.2 at 37°C for one hour. For example, 100 jil/well can be 
used in HEPES 50 mM buffer at pH 7.2, NaCl 15 mM, ATP 2 ^M, and bovine serum 
albumin 1 ug/well, 37°C for one hour. 

[0185] Binding compounds can also be characterized by their effect on the activity of the 
target molecule. Thus, a "low activity" compound has an inhibitory concentration (IC 5 o) 
or excitation concentration (EC 50) of greater than 1 jiM under standard conditions. By 
"very low activity" is meant an IC50 or EC50 of above 100 \iM under standard conditions. 
By "extremely low activity" is meant an IC50 or EC50 of above 1 mM under standard 
conditions. By "moderate activity" is meant an IC50 or EC50 of 200 nM to 1 jjM under 
standard conditions. By "moderately high activity" is meant an IC50 or EC50 of 1 nM to 
200 nM. By "high activity" is meant an IC50 or EC50 of below 1 nM under standard 
conditions. The IC50 (or EC50) is defined as the concentration of compound at which 50% 
of the activity of the target molecule (e.g., enzyme or other protein) activity being 
measured is lost (or gained) relative to activity when no compound is present. Activity 
can be measured using methods known to those of ordinary skill in the art, e.g., by 
measuring any detectable product or signal produced by occurrence of an enzymatic 
reaction, or other activity by a protein being measured. 
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[0186] By "background signal" in reference to a binding assay is meant the signal that is 
recorded under standard conditions for the particular assay in the absence of a test 
compound, molecular scaffold, or ligand that binds to the target molecule. Persons of 
ordinary skill in the art will realize that accepted methods exist and are widely available 
for determining background signal. 

[0187] By "standard deviation" is meant the square root of the variance. The variance is 
a measure of how spread out a distribution is. It is computed as the average squared 
deviation of each number from its mean. For example, for the numbers 1, 2, and 3, the 
mean is 2 and the variance is: 

a 2 = (l-2) 2 + (2-2) 2 + (3-2) 2 = 0.667 
3 

[0188] To design or discover scaffolds that act broadly across protein families, proteins 
of interest can be assayed against a compound collection or set. The assays can preferably 
be enzymatic or binding assays. In some embodiments it may be desirable to enhance the 
solubility of the compounds being screened and then analyze all compounds that show 
activity in the assay, including those that bind with low affinity or produce a signal with 
greater than about three times the standard deviation of the background signal. The assays 
can be any suitable assay such as, for example, binding assays that measure the binding 
affinity between two binding partners. Various types of screening assays that can be 
useful in the practice of the present invention are known in the art, such as those described 
in U.S. Patent Nos. 5,763,198, 5,747,276, 5,877,007, 6,243,980, 6,294,330, and 6,294,330, 
each of which is hereby incorporated by reference in its entirety, including all charts and 
drawings. 

[0189] In various embodiments of the assays at least one compound, at least about 5%, 
at least about 10%, at least about 15%, at least about 20%, or at least about 25% of the 
compounds can bind with low affinity. In general, up to about 20% of the compounds can 
show activity in the screening assay and these compounds can then be analyzed directly 
with high-throughput co-crystallography, computational analysis to group the compounds 
into classes with common structural properties (e.g., structural core and/or shape and 
polarity characteristics), and the identification of common chemical structures between 
compounds that show activity. 
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[0190] The person of ordinary skill in the art will realize that decisions can be based on 
criteria that are appropriate for the needs of the particular situation, and that the decisions 
can be made by computer software programs. Classes can be created containing almost 
any number of scaffolds, and the criteria selected can be based on increasingly exacting 
criteria until an arbitrary number of scaffolds is arrived at for each class that is deemed to 
be advantageous. 

Surface Plasmon Resonance 

[0191] Binding parameters can be measured using surface plasmon resonance, for 
example, with a BIAcore® chip (Biacore, Japan) coated with immobilized binding 
components. Surface plasmon resonance is used to characterize the microscopic 
association and dissociation constants of reaction between an sFv or other ligand directed 
against target molecules. Such methods are generally described in the following 
references which are incorporated herein by reference. Vely F. et al., (2000) BIAcore® 
analysis to test phosphopeptide-SH2 domain interactions, Methods in Molecular Biology. 
121:313-21; Liparoto et al., (1999) Biosensor analysis of the interleukin-2 receptor 
complex, Journal of Molecular Recognition. 12:316-21; Lipschultz et al., (2000) 
Experimental design for analysis of complex kinetics using surface plasmon resonance, 
Methods. 20(3):310-8; Malmqvist., (1999) BIACORE: an affinity biosensor system for 
characterization of biomolecular interactions, Biochemical Society Transactions 27:335- 
40; Alfthan, (1998) Surface plasmon resonance biosensors as a tool in antibody 
engineering, Biosensors & Bioelectronics. 13:653-63; Fivash et al., (1998) BIAcore for 
macromolecular interaction, Current Opinion in Biotechnology. 9:97-101; Price et al.; 
(1998) Summary report on the ISOBM TD-4 Workshop: analysis of 56 monoclonal 
antibodies against the MUC1 mucin. Tumour Biology 19 Suppl 1:1-20; Malmqvist et al, 
(1997) Biomolecular interaction analysis: affinity biosensor technologies for functional 
analysis of proteins, Current Opinion in Chemical Biology. 1:378-83; O'Shannessy et al., 
(1996) Interpretation of deviations from pseudo-first-order kinetic behavior in the 
characterization of ligand binding by biosensor technology, Analytical Biochemistry. 
236:275-83; Malmborg et al., (1995) BIAcore as a tool in antibody engineering, Journal of 
Immunological Methods. 183:7-13; Van Regenmortel, (1994) Use of biosensors to 
characterize recombinant proteins, Developments in Biological Standardization. 83:143- 
51; and O'Shannessy, (1994) Determination of kinetic rate and equilibrium binding 
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constants for macromolecular interactions: a critique of the surface plasmon resonance 
literature, Current Opinions in Biotechnology, 5:65-71. 

[0192] BIAcore® uses the optical properties of surface plasmon resonance (SPR) to 
detect alterations in protein concentration bound to a dextran matrix lying on the surface 
of a gold/glass sensor chip interface, a dextran biosensor matrix. In brief, proteins are 
covalently bound to the dextran matrix at a known concentration and a ligand for the 
protein is injected through the dextran matrix. Near infrared light, directed onto the 
opposite side of the sensor chip surface is reflected and also induces an evanescent wave 
in the gold film, which in turn, causes an intensity dip in the reflected light at a particular 
angle known as the resonance angle. If the refractive index of the sensor chip surface is 
altered (e.g., by ligand binding to the bound protein) a shift occurs in the resonance angle. 
This angle shift can be measured and is expressed as resonance units (RUs) such that 1000 
RUs is equivalent to a change in surface protein concentration of 1 ng/mm 2 . These 
changes are displayed with respect to time along the y-axis of a sensorgram, which depicts 
the association and dissociation of any biological reaction. 

High Throughput Screening (HTS) Assays 

[0193] HTS typically uses automated assays to search through large numbers of 
compounds for a desired activity. Typically HTS assays are used to find new drugs by 
screening for chemicals that act on a particular enzyme or molecule. For example, if a 
chemical inactivates an enzyme it might prove to be effective in preventing a process in a 
cell which causes a disease. High throughput methods enable researchers to assay 
thousands of different chemicals against each target molecule very quickly using robotic 
handling systems and automated analysis of results. 

[0194] As used herein, "high throughput screening" or "HTS" refers to the rapid in vitro 
screening of large numbers of compounds (libraries); generally tens to hundreds of 
thousands of compounds, using robotic screening assays. Ultra high-throughput Screening 
(uHTS) generally refers to the high-throughput screening accelerated to greater than 
100,000 tests per day. 

[0195] To achieve high-throughput screening, it is advantageous to house samples on a 

multicontainer carrier or platform. A multicontainer carrier facilitates measuring reactions 

of a plurality of candidate compounds simultaneously. Multi-well microplates may be 
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used as the carrier. Such multi-well microplates, and methods for their use in numerous 
assays, are both known in the art and commercially available. 

[0196] Screening assays may include controls for purposes of calibration and 
confirmation of proper manipulation of the components of the assay. Blank wells that 
contain all of the reactants but no member of the chemical library are usually included. As 
another example, a known inhibitor (or activator) of an enzyme for which modulators are 
sought, can be incubated with one sample of the assay, and the resulting decrease (or 
increase) in the enzyme activity used as a comparator or control. It will be appreciated 
that modulators can also be combined with the enzyme activators or inhibitors to find 
modulators which inhibit the enzyme activation or repression that is otherwise caused by 
the presence of the known the enzyme modulator. Similarly, when ligands to a 
sphingolipid target are sought, known ligands of the target can be present in 
control/calibration assay wells. 

Measuring Enzymatic and Binding Reactions During Screening Assays 

[0197] Techniques for measuring the progression of enzymatic and binding reactions, 
e.g., in multicontainer carriers, are known in the art and include, but are not limited to, the 
following. 

[0198] Spectrophotometric and spectrofluorometric assays are well known in the art. 
Examples of such assays include the use of colorimetric assays for the detection of 
peroxides, as disclosed in Example 1(b) and Gordon, A. J. and Ford, R. A., (1972) The 
Chemist's Companion: A Handbook Of Practical Data, Techniques, And References , John 
Wiley and Sons, N.Y., Page 437. 

[0199] Fluorescence spectrometry may be used to monitor the generation of reaction 
products. Fluorescence methodology is generally more sensitive than the absorption 
methodology. The use of fluorescent probes is well known to those skilled in the art. For 
reviews, see Bashford et al., (1987) Spectrophotometry and Spectrofluorometry: A 
Practical Approach , pp. 91-114, IRL Press Ltd.; and Bell, (1981) Spectroscopy In 
Biochemistry, Vol. I, pp. 155-194, CRC Press. 



[0200] In spectrofluorometric methods, enzymes are exposed to substrates that change 
their intrinsic fluorescence when processed by the target enzyme. Typically, the substrate 
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is nonfluorescent and is converted to a fluorophore through one or more reactions. As a 
non-limiting example, SMase activity can be detected using the Amplex® Red reagent 
(Molecular Probes, Eugene, OR). In order to measure sphingomyelinase activity using 
Amplex® Red, the following reactions occur. First, SMase hydrolyzes sphingomyelin to 
yield ceramide and phosphorylcholine. Second, alkaline phosphatase hydrolyzes 
phosphorylcholine to yield choline. Third, choline is oxidized by choline oxidase to 
betaine. Finally, H2O2, in the presence of horseradish peroxidase, reacts with Amplex® 
Red to produce the fluorescent product, Resorufin, and the signal therefrom is detected 
using spectrofluorometry. 

[0201] Fluorescence polarization (FP) is based on a decrease in the speed of molecular 
rotation of a fluorophore that occurs upon binding to a larger molecule, such as a receptor 
protein, allowing for polarized fluorescent emission by the bound ligand. FP is 
empirically determined by measuring the vertical and horizontal components of 
fluorophore emission following excitation with plane polarized light. Polarized emission 
is increased when the molecular rotation of a fluorophore is reduced. A fluorophore 
produces a larger polarized signal when it is bound to a larger molecule (i.e. a receptor), 
slowing molecular rotation of the fluorophore. The magnitude of the polarized signal 
relates quantitatively to the extent of fluorescent ligand binding. Accordingly, polarization 
of the "bound" signal depends on maintenance of high affinity binding. 

[0202] FP is a homogeneous technology and reactions are very rapid, taking seconds to 
minutes to reach equilibrium. The reagents are stable, and large batches may be prepared, 
resulting in high reproducibility. Because of these properties, FP has proven to be highly 
automatable, often performed with a single incubation with a single, premixed, tracer- 
receptor reagent. For a review, see Owickiet al., (1997), Application of Fluorescence 
Polarization Assays in High-Throughput Screening, Genetic Engineering News, 17:27. 

[0203] FP is particularly desirable since its readout is independent of the emission 
intensity (Checovich, W. J., et al., (1995) Nature 375:254-256; Dandliker, W. B., et al., 
(1981) Methods in Enzymology 74:3-28) and is thus insensitive to the presence of colored 
compounds that quench fluorescence emission. FP and FRET (see below) are well-suited 
for identifying compounds that block interactions between sphingolipid receptors and their 
ligands. See, for example, Parker et al., (2000) Development of high throughput screening 
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assays using fluorescence polarization: nuclear receptor-ligand-binding and 
kinase/phosphatase assays, J Biomol Screen 5:77-88. 

[0204] Fluorophores derived from sphingolipids that may be used in FP assays are 
commercially available. For example, Molecular Probes (Eugene, OR) currently sells 
sphingomyelin and one ceramide flurophores. These are, respectively, N-(4,4-difluoro- 
5,7-dimethyl-4-bora-3a,4a-diaza-s-indacene- 3-pentanoyl)sphingosyl phosphocholine 
(BODIPY® FL C5-sphingomyelin); N-(4,4-difluoro-5,7-dimethyl-4-bora-3a,4a-diaza-s- 
indacene- 3-dodecanoyl)sphingosyl phosphocholine (BODIPY® FL C12-sphingomyelin); 
and N-(4,4-difluoro-5,7-dimethyl-4-bora-3a,4a-diaza-s-indacene- 3-pentanoyl)sphingosine 
(BODIPY® FL C5-ceramide). U.S. Patent No. 4,150,949, (Immunoassay for gentamicin), 
discloses fluorescein-labelled gentamicins, including fluoresceinthiocarbanyl gentamicin. 
Additional fluorophores may be prepared using methods well known to the skilled artisan. 

[0205] Exemplary normal-and-polarized fluorescence readers include the POLARION 
fluorescence polarization system (Tecan AG, Hombrechtikon, Switzerland). General 
multiwell plate readers for other assays are available, such as the VERSAMAX® reader 
and the SPECTRAMAX® multiwell plate spectrophotometer (both from Molecular 
Devices). 

[0206] Fluorescence resonance energy transfer (FRET) is another useful assay for 
detecting interaction and has been described. See, e.g., Heim et al., (1996) Curr. Biol 
6:178-182; Mitra et al., (1996) Gene 173:13-17; and Selvin et al., (1995) Meth. Enzymol 
246:300-345. FRET detects the transfer of energy between two fluorescent substances in 
close proximity, having known excitation and emission wavelengths. As an example, a 
protein can be expressed as a fusion protein with green fluorescent protein (GFP). When 
two fluorescent proteins are in proximity, such as when a protein specifically interacts 
with a target molecule, the resonance energy can be transferred from one excited molecule 
to the other. As a result, the emission spectrum of the sample shifts, which can be 
measured by a fluorometer, such as a fMAX multiwell fluorometer (Molecular Devices, 
Sunnyvale Calif). 

[0207] Scintillation proximity assay (SPA) is a particularly useful assay for detecting an 
interaction with the target molecule. SPA is widely used in the pharmaceutical industry 
and has been described (Hanselman et al., (1997) J. Lipid Res. 38:2365-2373; Kahl et al., 

56 



Atty. Dkt. No.: 039363-1106 



(\996)Anal. Biochem. 243:282-283; Undenfriend et al., (\9%l)Anal Biochem. 161:494- 
500). See also U.S. Patent Nos. 4,626,513 and 4,568,649, and European Patent No. 
0,154,734. One commercially available system uses FLASHPLATE® scintillant-coated 
plates (NEN Life Science Products, Boston, MA). 

[0208] The target molecule can be bound to the scintillator plates by a variety of well 
known means. Scintillant plates are available that are derivatized to bind to fusion 
proteins such as GST, His6 or Flag fusion proteins. Where the target molecule is a protein 
complex or a multimer, one protein or subunit can be attached to the plate first, then the 
other components of the complex added later under binding conditions, resulting in a 
bound complex. 

[0209] In a typical SPA assay, the gene products in the expression pool will have been 
radiolabeled and added to the wells, and allowed to interact with the solid phase, which is 
the immobilized target molecule and scintillant coating in the wells. The assay can be 
measured immediately or allowed to reach equilibrium. Either way, when a radiolabel 
becomes sufficiently close to the scintillant coating, it produces a signal detectable by a 
device such as a TOPCOUNT NXT® microplate scintillation counter (Packard Bioscience 
Co., Meriden Conn.). If a radiolabeled expression product binds to the target molecule, 
the radiolabel remains in proximity to the scintillant long enough to produce a detectable 
signal. 

[0210] In contrast, the labeled proteins that do not bind to the target molecule, or bind 
only briefly, will not remain near the scintillant long enough to produce a signal above 
background. Any time spent near the scintillant caused by random Brownian motion will 
also not result in a significant amount of signal. Likewise, residual unincorporated 
radiolabel used during the expression step may be present, but will not generate significant 
signal because it will be in solution rather than interacting with the target molecule. These 
non-binding interactions will therefore cause a certain level of background signal that can 
be mathematically removed. If too many signals are obtained, salt or other modifiers can 
be added directly to the assay plates until the desired specificity is obtained (Nichols et al., 
(1998) Anal Biochem. 257:112-119). 

Assay Compounds and Molecular Scaffolds 
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[0211] Preferred characteristics of a scaffold include being of low molecular weight 
(e.g., less than 350 Da, or from about 100 to about 350 daltons, or from about 150 to about 
300 daltons). Preferably clog P of a scaffold is from -1 to 8, more preferably less than 6, 
5, or 4, most preferably less than 3. In particular embodiments the clogP is in a range -1 
to an upper limit of 2, 3, 4, 5, 6, or 8; or is in a range of 0 to an upper limit of 2,3, 4, 5, 6, 
or 8. Preferably the number of rotatable bonds is less than 5, more preferably less than 4. 
Preferably the number of hydrogen bond donors and acceptors is below 6, more preferably 
below 5. An additional criterion that can be useful is a polar surface area of less than 5. 
Guidance that can be useful in identifying criteria for a particular application can be found 
in Lipinski et al., (1997) Advanced Drug Delivery Reviews 23 3-25, which is hereby 
incorporated by reference in its entirety. 

[0212] A scaffold may preferably bind to a given protein binding site in a configuration 
that causes substituent moieties of the scaffold to be situated in pockets of the protein 
binding site. Also, possessing chemically tractable groups that can be chemically 
modified, particularly through synthetic reactions, to easily create a combinatorial library 
can be a preferred characteristic of the scaffold. Also preferred can be having positions on 
the scaffold to which other moieties can be attached, which do not interfere with binding 
of the scaffold to the protein(s) of interest but do cause the scaffold to achieve a desirable 
property, for example, active transport of the scaffold to cells and/or organs, enabling the 
scaffold to be attached to a chromatographic column to facilitate analysis, or another 
desirable property. A molecular scaffold can bind to a target molecule with any affinity, 
such as binding at high affinity, moderate affinity, low affinity, very low affinity, or 
extremely low affinity. 

[0213] Thus, the above criteria can be utilized to select many compounds for testing that 
have the desired attributes. Many compounds having the criteria described are available in 
the commercial market, and may be selected for assaying depending on the specific needs 
to which the methods are to be applied. 

[0214] A "compound library" or "library" is a collection of different compounds having 
different chemical structures. A compound library is screenable, that is, the compound 
library members therein may be subject to screening assays. In preferred embodiments, 
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the library members can have a molecular weight of from about 100 to about 350 daltons, 
or from about 150 to about 350 daltons. Examples of libraries are provided aove. 

[0215] Libraries of the present invention can contain at least one compound than binds 
to the target molecule at low affinity. Libraries of candidate compounds can be assayed 
by many different assays, such as those described above, e.g., a fluorescence polarization 
assay. Libraries may consist of chemically synthesized peptides, peptidomimetics, or 
arrays of combinatorial chemicals that are large or small, focused or nonfocused. By 
"focused" it is meant that the collection of compounds is prepared using the structure of 
previously characterized compounds and/or pharmacophores. 

[0216] Compound libraries may contain molecules isolated from natural sources, 
artificially synthesized molecules, or molecules synthesized, isolated, or otherwise 
prepared in such a manner so as to have one or more moieties variable, e.g., moieties that 
are independently isolated or randomly synthesized. Types of molecules in compound 
libraries include but are not limited to organic compounds, polypeptides and nucleic acids 
as those terms are used herein, and derivatives, conjugates and mixtures thereof. 

[0217] Compound libraries of the invention may be purchased on the commercial market 
or prepared or obtained by any means including, but not limited to, combinatorial 
chemistry techniques, fermentation methods, plant and cellular extraction procedures and 
the like (see, e.g., Cwirla et al., (1990) Biochemistry, 87, 6378-6382; Houghten et al., 
(1991) Nature, 354, 84-86; Lam et al., (1991) Nature, 354, 82-84; Brenner et al., (1992) 
Proc. Natl Acad. Sci. USA, 89, 5381-5383; R. A. Houghten, (1993) Trends Genet., 9, 235- 
239; E. R. Felder, (1994) Chimia, 48, 512-541; Gallop et al., (1994) J. Med. Chem., 37, 
1233-1251; Gordon et al., (1994) J. Med. Chem., 37, 1385-1401; Carell et al., (1995) 
Chem. Biol, 3, 171-183; Madden et al., Perspectives in Drug Discovery and Design 2, 
269-282; Lebl et al., (1995) Biopolymers, 37 177-198); small molecules assembled around 
a shared molecular structure; collections of chemicals that have been assembled by various 
commercial and noncommercial groups, natural products; extracts of marine organisms, 
fungi, bacteria, and plants. 

[0218] Preferred libraries can be prepared in a homogenous reaction mixture, and 
separation of unreacted reagents from members of the library is not required prior to 
screening. Although many combinatorial chemistry approaches are based on solid state 
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chemistry, liquid phase combinatorial chemistry is capable of generating libraries (Sun 
CM., (1999) Recent advances in liquid-phase combinatorial chemistry, Combinatorial 
Chemistry & High Throughput Screening. 2:299-318). 

[0219] Libraries of a variety of types of molecules are prepared in order to obtain 
members therefrom having one or more preselected attributes that can be prepared by a 
variety of techniques, including but not limited to parallel array synthesis (Houghton, 
(2000) Annu Rev Pharmacol Toxicol 40:273-82, Parallel array and mixture-based 
synthetic combinatorial chemistry; solution-phase combinatorial chemistry (Merritt, 

(1998) Comb Chem High Throughput Screen l(2):57-72, Solution phase combinatorial 
chemistry, Coe et al., (1998-99) Mol Divers;4(l):3l-$, Solution-phase combinatorial 
chemistry, Sun, (1999) Comb Chem High Throughput Screen 2(6):299-318, Recent 
advances in liquid-phase combinatorial chemistry); synthesis on soluble polymer (Gravert 
et al., (1997) Curr Opin Chem Biol 1(1): 107- 13, Synthesis on soluble polymers: new 
reactions and the construction of small molecules); and the like. See, e.g., Dolle et al., 

(1999) J Comb Chem l(4):235-82, Comprehensive survey of cominatorial library 
synthesis: 1998. Freidinger RM., (1999) Nonpeptidic ligands for peptide and protein 
receptors, Current Opinion in Chemical Biology; and Kundu et al., Prog Drug i?es;53:89- 
156, Combinatorial chemistry: polymer supported synthesis of peptide and non-peptide 
libraries). Compounds may be clinically tagged for ease of identification (Chabala, (1995) 
Curr Opin Biotechnol 6(6):633-9, Solid-phase combinatorial chemistry and novel tagging 
methods for identifying leads). 

[0220] The combinatorial synthesis of carbohydrates and libraries containing 
oligosaccharides have been described (Schweizer et al., (1999) Curr Opin Chem Biol 
3(3):291-8, Combinatorial synthesis of carbohydrates). The synthesis of natural-product 
based compound libraries has been described (Wessjohann, (2000) Curr Opin Chem Biol 
4(3):303-9, Synthesis of natural-product based compound libraries). 

[0221] Libraries of nucleic acids are prepared by various techniques, including by way 
of non-limiting example the ones described herein, for the isolation of aptamers. Libraries 
that include oligonucleotides and polyaminooligonucleotides (Markiewicz et al., (2000) 
Synthetic oligonucleotide combinatorial libraries and their applications, Farmaco. 55:174- 
7) displayed on streptavidin magnetic beads are known. Nucleic acid libraries are known 
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that can be coupled to parallel sampling and be deconvoluted without complex procedures 
such as automated mass spectrometry (Enjalbal C. Martinez J. Aubagnac JL, (2000) 
Mass spectrometry in combinatorial chemistry, Mass Spectrometry Reviews. 19:139-61) 
and parallel tagging. (Perrin DM., Nucleic acids for recognition and catalysis: landmarks, 
limitations, and looking to the future, Combinatorial Chemistry & High Throughput 
Screening 3:243-69). 

[0222] Peptidomimetics are identified using combinatorial chemistry and solid phase 
synthesis (Kim HO. Kahn M., (2000) A merger of rational drug design and combinatorial 
chemistry: development and application of peptide secondary structure mimetics, 
Combinatorial Chemistry & High Throughput Screening 3:167-83; al-Obeidi, (1998) Mol 
Biotechnol 9(3): 205 -23, Peptide and peptidomimetric libraries. Molecular diversity and 
drug design). The synthesis may be entirely random or based in part on a known 
polypeptide. 

[0223] Polypeptide libraries can be prepared according to various techniques. In brief, 
phage display techniques can be used to produce polypeptide ligands (Gram H., (1999) 
Phage display in proteolysis and signal transduction, Combinatorial Chemistry & High 
Throughput Screening. 2:19-28) that may be used as the basis for synthesis of 
peptidomimetics. Polypeptides, constrained peptides, proteins, protein domains, 
antibodies, single chain antibody fragments, antibody fragments, and antibody combining 
regions are displayed on filamentous phage for selection. 

[0224] Large libraries of individual variants of human single chain Fv antibodies have 
been produced. See, e.g., Siegel RW. Allen B. Pavlik P. Marks JD. Bradbury A., (2000) 
Mass spectral analysis of a protein complex using single-chain antibodies selected on a 
peptide target: applications to functional genomics, Journal of Molecular Biology 
302:285-93; Poul MA. Becerril B. Nielsen UB. Morisson P. Marks JD.,(2000) Selection 
of tumor-specific internalizing human antibodies from phage libraries. Source Journal of 
Molecular Biology. 301:1149-61; Amersdorfer P. Marks JD., (2001) Phage libraries for 
generation of anti-botulinum scFv antibodies, Methods in Molecular Biology. 145:219-40; 
Hughes- Jones NC. Bye JM. Gorick BD. Marks JD. Ouwehand WH., (1999) Synthesis of 
Rh Fv phage-antibodies using VH and VL germline genes, British Journal of 
Haematology. 105:81 1-6; McCall AM. Amoroso AR. Sautes C. Marks JD. Weiner LM., 
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(1998) Characterization of anti-mouse Fc gamma RII single-chain Fv fragments derived 
from human phage display libraries, Immunotechnology. 4:71-87; Sheets MD. 
Amersdorfer P. Finnern R. Sargent P. Lindquist E. Schier R. Hemingsen G. Wong C. 
Gerhart JC. Marks JD. Lindquist E., (1998) Efficient construction of a large nonimmune 
phage antibody library: the production of high-affinity human single-chain antibodies to 
protein antigens (published erratum appears in Proc Natl Acad Sci USA 1999 96:795), 
Proc Natl Acad Sci USA 95:6157-62). 

[0225] Focused or smart chemical and pharmacophore libraries can be designed with the 
help of sophisticated strategies involving computational chemistry (e.g., Kundu B. Khare 
SK. Rastogi SK., (1999) Combinatorial chemistry: polymer supported synthesis of 
peptide and non-peptide libraries, Progress in Drug Research 53:89-156) and the use of 
structure-based ligands using database searching and docking, de novo drug design and 
estimation of ligand binding affinities (Joseph-McCarthy D., (1999) Computational 
approaches to structure-based ligand design, Pharmacology & Therapeutics 84:179-91; 
Kirkpatrick DL. Watson S. Ulhaq S., (1999) Structure-based drug design: combinatorial 
chemistry and molecular modeling, Combinatorial Chemistry & High Throughput 
Screening. 2:21 1-21 ; Eliseev AV. Lehn JM., (1999) Dynamic combinatorial chemistry: 
evolutionary formation and screening of molecular libraries, Current Topics in 
Microbiology & Immunology 243:159-72; Bolger et al., (1991) Methods Enz. 203:21-45; 
Martin, (1991) Methods Enz. 203:587-613; Neidle et al., (1991) Methods Enz. 203:433- 
458; U.S. Patent 6,178,384). 

X. Crystallography 

[0226] After binding compounds have been determined, the orientation of compound 
bound to target is determined. Preferably this determination involves crystallography on 
co-crystals of molecular scaffold compounds with target. Most protein crystallographic 
platforms can preferably be designed to analyze up to about 500 co-complexes of 
compounds, ligands, or molecular scaffolds bound to protein targets due to the physical 
parameters of the instruments and convenience of operation. If the number of scaffolds 
that have binding activity exceeds a number convenient for the application of 
crystallography methods, the scaffolds can be placed into groups based on having at least 
one common chemical structure or other desirable characteristics, and representative 
compounds can be selected from one or more of the classes. Classes can be made with 
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increasingly exacting criteria until a desired number of classes (e.g., 500) is obtained. The 
classes can be based on chemical structure similarities between molecular scaffolds in the 
class, e.g., all possess a pyrrole ring, benzene ring, or other chemical feature. Likewise, 
classes can be based on shape characteristics, e.g., space-filling characteristics. 

[0227] The co-crystallography analysis can be performed by co-complexing each 
scaffold with its target at concentrations of the scaffold that showed activity in the 
screening assay. This co-complexing can be accomplished with the use of low percentage 
organic solvents with the target molecule and then concentrating the target with each of 
the scaffolds. In preferred embodiments these solvents are less than 5% organic solvent 
such as dimethyl sulfoxide (DMSO), ethanol, methanol, or ethylene glycol in water or 
another aqueous solvent. Each scaffold complexed to the target molecule can then be 
screened with a suitable number of crystallization screening conditions at both 4 and 20 
degrees. In preferred embodiments, about 96 crystallization screening conditions can be 
performed in order to obtain sufficient information about the co-complexation and 
crystallization conditions, and the orientation of the scaffold at the binding site of the 
target molecule. Crystal structures can then be analyzed to determine how the bound 
scaffold is oriented physically within the binding site or within one or more binding 
pockets of the molecular family member. 

[0228] It is desirable to determine the atomic coordinates of the compounds bound to the 
target proteins in order to determine which is a most suitable scaffold for the protein 
family. X-ray crystallographic analysis is therefore most preferable for determining the 
atomic coordinates. Those compounds selected can be further tested with the application 
of medicinal chemistry. Compounds can be selected for medicinal chemistry testing based 
on their binding position in the target molecule. For example, when the compound binds 
at a binding site, the compound's binding position in the binding site of the target 
molecule can be considered with respect to the chemistry that can be performed on 
chemically tractable structures or sub-structures of the compound, and how such 
modifications on the compound might interact with structures or sub-structures on the 
binding site of the target. Thus, one can explore the binding site of the target and the 
chemistry of the scaffold in order to make decisions on how to modify the scaffold to 
arrive at a ligand with higher potency and/or selectivity. This process allows for more 
direct design of ligands, by utilizing structural and chemical information obtained directly 
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from the co-complex, thereby enabling one to more efficiently and quickly design lead 
compounds that are likely to lead to beneficial drug products. In various embodiments it 
may be desirable to perform co-crystallography on all scaffolds that bind, or only those 
that bind with a particular affinity, for example, only those that bind with high affinity, 
moderate affinity, low affinity, very low affinity, or extremely low affinity. It may also be 
advantageous to perform co-crystallography on a selection of scaffolds that bind with any 
combination of affinities. 

[0229] Standard X-ray protein diffraction studies such as by using a Rigaku RU-200® 
(Rigaku, Tokyo, Japan) with an X-ray imaging plate detector or a synchrotron beam-line 
can be performed on co-crystals and the diffraction data measured on a standard X-ray 
detector, such as a CCD detector or an X-ray imaging plate detector. 

[0230] Performing X-ray crystallography on about 200 co-crystals should generally lead 
to about 50 co-crystals structures, which should provide about 10 scaffolds for validation 
in chemistry, which should finally result in about 5 selective leads for target molecules. 

Virtual Assays 

[0231] Commercially available software that generates three-dimensional graphical 
representations of the complexed target and compound from a set of coordinates provided 
can be used to illustrate and study how a compound is oriented when bound to a target, 
(e.g., QUANTA®, Accelerys, San Diego, CA). Thus, the existence of binding pockets at 
the binding site of the targets can be particularly useful in the present invention. These 
binding pockets are revealed by the crystallographic structure determination and show the 
precise chemical interactions involved in binding the compound to the binding site of the 
target. The person of ordinary skill will realize that the illustrations can also be used to 
decide where chemical groups might be added, substituted, modified, or deleted from the 
scaffold to enhance binding or another desirable effect, by considering where unoccupied 
space is located in the complex and which chemical substructures might have suitable size 
and/or charge characteristics to fill it. The person of ordinary skill will also realize that 
regions within the binding site can be flexible and its properties can change as a result of 
scaffold binding, and that chemical groups can be specifically targeted to those regions to 
achieve a desired effect. Specific locations on the molecular scaffold can be considered 
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with reference to where a suitable chemical substructure can be attached and in which 
conformation, and which site has the most advantageous chemistry available. 

[0232] An understanding of the forces that bind the compounds to the target proteins 
reveals which compounds can most advantageously be used as scaffolds, and which 
properties can most effectively be manipulated in the design of ligands. The person of 
ordinary skill will realize that steric, ionic, hydrogen bond, and other forces can be 
considered for their contribution to the maintenance or enhancement of the target- 
compound complex. Additional data can be obtained with automated computational 
methods, such as docking and/or Free Energy Perturbations (FEP), to account for other 
energetic effects such as desolvation penalties. The compounds selected can be used to 
generate information about the chemical interactions with the target or for. elucidating 
chemical modifications that can enhance selectivity of binding of the compound. 

[0233] Computer models, such as homology models (i.e., based on a known, 
experimentally derived structure) can be constructed using data from the co-crystal 
structures. When the target molecule is a protein or enzyme, preferred co-crystal 
structures for making homology models contain high sequence identity in the binding site 
of the protein sequence being modeled, and the proteins will preferentially also be within 
the same class and/or fold family. Knowledge of conserved residues in active sites of a 
protein class can be used to select homology models that accurately represent the binding 
site. Homology models can also be used to map structural information from a surrogate 
protein where an apo or co-crystal structure exists to the target protein. 

[0234] Virtual screening methods, such as docking, can also be used to predict the 
binding configuration and affinity of scaffolds, compounds, and/or combinatorial library 
members to homology models. Using this data, and carrying out "virtual experiments" 
using computer software can save substantial resources and allow the person of ordinary 
skill to make decisions about which compounds can be suitable scaffolds or ligands, 
without having to actually synthesize the ligand and perform co-crystallization. Decisions 
thus can be made about which compounds merit actual synthesis and co-crystallization. 
An understanding of such chemical interactions aids in the discovery and design of drugs 
that interact more advantageously with target proteins and/or are more selective for one 
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protein family member over others. Thus, applying these principles, compounds with 
superior properties can be discovered. 

[0235] Additives that promote co-crystallization can of course be included in the target 
molecule formulation in order to enhance the formation of co-crystals. In the case of 
proteins or enzymes, the scaffold to be tested can be added to the protein formulation, 
which is preferably present at a concentration of approximately 1 mg/ml. The formulation 
can also contain between 0%-10% (v/v) organic solvent, e.g. DMSO, methanol, ethanol, 
propane diol, or 1,3 dimethyl propane diol (MPD) or some combination of those organic 
solvents. Compounds are preferably solubilized in the organic solvent at a concentration 
of about 10 mM and added to the protein sample at a concentration of about 100 mM, The 
protein-compound complex is then concentrated to a final concentration of protein of from 
about 5 to about 20 mg/ml. The complexation and concentration steps can conveniently 
be performed using a 96-well formatted concentration apparatus (e.g., Amicon Inc., 
Piscataway, NJ). Buffers and other reagents present in the formulation being crystallized 
can contain other components that promote crystallization or are compatible with 
crystallization conditions, such as DTT, propane diol, glycerol. 

[0236] The crystallization experiment can be set-up by placing small aliquots of the 
concentrated protein-compound complex (1 \xl) in a 96 well format and sampling under 96 
crystallization conditions. (Other screening formats can also be used, e.g., plates with 
greater than 96 wells.) Crystals can typically be obtained using standard crystallization 
protocols that can involve the 96 well crystallization plate being placed at different 
temperatures. Co-crystallization varying factors other than temperature can also be 
considered for each protein-compound complex if desirable. For example, atmospheric 
pressure, the presence or absence of light or oxygen, a change in gravity, and many other 
variables can all be tested. The person of ordinary skill in the art will realize other 
variables that can advantageously be varied and considered. 

Ligand Design and Preparation 

[0237] The design and preparation of ligands can be performed with or without 
structural and/or co-crystallization data by considering the chemical structures in common 
between the active scaffolds of a set. In this process structure-activity hypotheses can be 
formed and those chemical structures found to be present in a substantial number of the 
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scaffolds, including those that bind with low affinity, can be presumed to have some effect 
on the binding of the scaffold. This binding can be presumed to induce a desired 
biochemical effect when it occurs in a biological system (e.g., a treated mammal). New or 
modified scaffolds or combinatorial libraries derived from scaffolds can be tested to 
disprove the maximum number of binding and/or structure-activity hypotheses. The 
remaining hypotheses can then be used to design ligands that achieve a desired binding 
and biochemical effect. 

[0238] But in many cases it will be preferred to have co-crystallography data for 
consideration of how to modify the scaffold to achieve the desired binding effect (e.g., 
binding at higher affinity or with higher selectivity). Using the case of proteins and 
enzymes, co-crystallography data shows the binding pocket of the protein with the 
molecular scaffold bound to the binding site, and it will be apparent that a modification 
can be made to a chemically tractable group on the scaffold. For example, a small volume 
of space at a protein binding site or pocket might be filled by modifying the scaffold to 
include a small chemical group that fills the volume. Filling the void volume can be 
expected to result in a greater binding affinity, or the loss of undesirable binding to 
another member of the protein family. Similarly, the co-crystallography data may show 
that deletion of a chemical group on the scaffold may decrease a hindrance to binding and 
result in greater binding affinity or specificity. 

[0239] It can be desirable to take advantage of the presence of a charged chemical group 
located at the binding site or pocket of the protein. For example, a positively charged 
group can be complemented with a negatively charged group introduced on the molecular 
scaffold. This can be expected to increase binding affinity or binding specificity, thereby 
resulting in a more desirable ligand. In many cases, regions of protein binding sites or 
pockets are known to vary from one family member to another based on the amino acid 
differences in those regions. Chemical additions in such regions can result in the creation 
or elimination of certain interactions (e.g., hydrophobic, electrostatic, or entropic) that 
allow a compound to be more specific for one protein target over another or to bind with 
greater affinity, thereby enabling one to synthesize a compound with greater selectivity or 
affinity for a particular family member. Additionally, certain regions can contain amino 
acids that are known to be more flexible than others. This often occurs in amino acids 
contained in loops connecting elements of the secondary structure of the protein, such as 
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alpha helices or beta strands. Additions of chemical moieties can also be directed to these 
flexible regions in order to increase the likelihood of a specific interaction occurring 
between the protein target of interest and the compound. Virtual screening methods can 
also be conducted in silico to assess the effect of chemical additions, subtractions, 
modifications, and/or substitutions on compounds with respect to members of a protein 
family or class. 

[0240] The addition, subtraction, or modification of a chemical structure or sub-structure 
to a scaffold can be performed with any suitable chemical moiety. For example the 
following moieties, which are provided by way of example and are not intended to be 
limiting, can be utilized: hydrogen, alkyl, alkoxy, phenoxy, alkenyl, alkynyl, phenylalkyl, 
hydroxyalkyl, haloalkyl, aryl, arylalkyl, alkyloxy, alkylthio, alkenylthio, phenyl, 
phenylalkyl, phenylalkylthio, hydroxyalkyl-thio, alkylthiocarbbamylthio, cyclohexyl, 
pyridyl, piperidinyl, alkylamino, amino, nitro, mercapto, cyano, hydroxyl, a halogen atom, 
halomethyl, an oxygen atom (e.g., forming a ketone or N-oxide) or a sulphur atom (e.g., 
forming a thiol, thione, di-alkylsulfoxide or sulfone) are all examples of moieties that can 
be utilized. 

[0241] Additional examples of structures or sub-structures that may be utilized are an 
aryl optionally substituted with one, two, or three substituents independently selected from 
the group consisting of alkyl, alkoxy, halogen, trihalomethyl, carboxylate, carboxamide, 
nitro, and ester moieties; an amine of formula -NX2X3, where X2 and X3 are independently 
selected from the group consisting of hydrogen, saturated or unsaturated alkyl, and 
homocyclic or heterocyclic ring moieties; halogen or trihalomethyl; a ketone of formula - 
COX4, where X4 is selected from the group consisting of alkyl and homocyclic or 
heterocyclic ring moieties; a carboxylic acid of formula -(X 5 ) n COOH or ester of formula 
(X 6 ) n COOX 7 , where X 5 , X 6 , and X 7 and are independently selected from the group 
consisting of alkyl and homocyclic or heterocyclic ring moieties and where n is 0 or 1 ; an 
alcohol of formula (Xg) n OH or an alkoxy moiety of formula -(X 8 ) n OX9, where Xg and X9 
are independently selected from the group consisting of saturated or unsaturated alkyl and 
homocyclic or heterocyclic ring moieties, wherein said ring is optionally substituted with 
one or more substituents independently selected from the group consisting of alkyl, 
alkoxy, halogen, trihalomethyl, carboxylate, nitro, and ester and where n is 0 or 1; an 
amide of formula NHCOX10, where X10 is selected from the group consisting of alkyl, 
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hydroxyl, and homocyclic or heterocyclic ring moieties, wherein said ring is optionally 
substituted with one or more substituents independently selected from the group consisting 
of alkyl, alkoxy, halogen, trihalomethyl, carboxylate, nitro, and ester; S02, NXn Xn, 
where Xu and X12 are selected from the group consisting of hydrogen, alkyl, and 
homocyclic or heterocyclic ring moieties; a homocyclic or heterocyclic ring moiety 
optionally substituted with one, two, or three substituents independently selected from the 
group consisting of alkyl, alkoxy, halogen, trihalomethyl, carboxylate, carboxamide, nitro, 
and ester moieties; an aldehyde of formula -CHO; a sulfone of formula -SO2X13, where 
X13 is selected from the group consisting of saturated or unsaturated alkyl and homocyclic 
or heterocyclic ring moieties; and a nitro of formula -NO2. 

Identification of Attachment Sites on Molecular Scaffolds and Ligands 

[0242] In addition to the identification and development of ligands for 
phosphodiesterases and other enzymes, determination of the orientation of a molecular 
scaffold or other binding compound in a binding site allows identification of energetically 
allowed sites for attachment of the binding molecule to another component. For such 
sites, any free energy change associated with the presence of the attached component 
should not destablize the binding of the compound to the phosphodiesterase to an extent 
that will disrupt the binding. Preferably, the binding energy with the attachment should be 
at least 4 kcal/moL, more preferably at least 6, 8, 10, 12, 15, or 20 kcal/mol. Preferably, 
the presence of the attachment at the particular site reduces binding energy by no more 
than 3, 4, 5, 8, 10, 12, or 15 kcal/mol. 

[0243] In many cases, suitable attachment sites will be those that are exposed to solvent 
when the binding compound is bound in the binding site. In some cases, attachment sites 
can be used that will result in small displacements of a portion of the enzyme without an 
excessive energetic cost. Exposed sites can be identified in various ways. For example, 
exposed sites can be identified using a graphic display or 3-dimensional model. In a 
grahic display, such as a computer display, an image of a compound bound in a binding 
site can be visually inspected to reveal atoms or groups on the compound that are exposed 
to solvent and oriented such that attachment at such atom or group would not preclude 
binding of the enzyme and binding compound. Energetic costs of attachment can be 
calculated based on changes or distortions that would be caused by the attachment as well 
as entropic changes. 
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[0244] Many different types of components can be attached. Persons with skill are 
familiar with the chemistries used for various attachments. Examples of components that 
can be attached include, without limitation: solid phase components such as beads, plates, 
chips, and wells; a direct or indirect label; a linker, which may be a traceless linker; among 
others. Such linkers can themselves be attached to other components, e.g., to solid phase 
media, labels, and/or binding moieties. 

[0245] The binding energy of a compound and the effects on binding energy for 
attaching the molecule to another component can be calculated approximately using any of 
a variety of available software or by manual calculation. An example is the following: 

[0246] Calculations were performed to estimate binding energies of different organic 
molecules to two Kinases: PIM-1 and CDK2. The organic molecules considered included 
Staurosporine, identified compounds that bind to PDE5A, and several linkers. 

[0247] Calculated binding energies between protein-ligand complexes were obtained 
using the FlexX score (an implementation of the Bohm scoring function) within the Tripos 
software suite. The form for that equation is shown in the equation below: 

AGbind = AGtr + AGhb + AGion + AGlipo + AGarom + AGrot 

[0248] where: AGtr is a constant term that accounts for the overall loss of rotational and 
translational entropy of the lignand, AGhb accounts for hydrogen bonds formed between 
the ligand and protein, AGion accounts for the ionic interactions between the ligand and 
protein, AGlipo accounts for the lipophilic interaction that corresponds to the protein- 
ligand contact surface, AGarom accounts for interactions between aromatic rings in the 
protein and ligand, and AGrot accounts for the entropic penalty of restricting rotatable 
bonds in the ligand upon binding. 

[0249] This method estimates the free energy that a lead compound should have to a 
target protein for which there is a crystal structure, and it accounts for the entropic penalty 
of flexible linkers. It can therefore be used to estimate the free energy penalty incurred by 
attaching linkers to molecules being screened and the binding energy that a lead 
compound should have in order to overcome the free energy penalty of the linker. The 
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method does not account for solvation and the entropic penalty is likely overestimated for 
cases where the linker is bound to a solid phase through another binding complex, such as 
a biotin:streptavidin complex. 

[0250] Co-crystals were aligned by superimposing residues of PIM-1 with 
corresponding residues in CDK2. The PIM-1 structure used for these calculations was a 
co-crystal of PIM-1 with a binding compound. The CDK2:Staurosporine co-crystal used 
was from the Brookhaven database file laqL Hydrogen atoms were added to the proteins 
and atomic charges were assigned using the AMBER95 parameters within Sybyl. 
Modifications to the compounds described were made within the Sybyl modeling suite 
from Tripos. 

[0251] These calcualtions indicate that the calculated binding energy for compounds that 
bind strongly to a given target (such as Staurosporine:CDK2) can be lower than —25 
kcal/mol, while the calculated binding affinity for a good scaffold or an unoptimized 
binding compound can be in the range of -15 to -20. The free energy penalty for 
attachment to a linker such as the ethylene glycol or hexatriene is estimated as typically 
being in the range of +5 to +15 kcal/mol. 

Linkers 

[0252] Linkers suitable for use in the invention can be of many different types. Linkers 
can be selected for particular applications based on factors such as linker chemistry 
compatible for attachment to a binding compound and to another component utilized in the 
particular application. Additional factors can include, without limitation, linker length, 
linker stability, and ability to remove the linker at an appropriate time. Exemplary linkers 
include, but are not limited to, hexyl, hexatrienyl, ethylene glycol, and peptide linkers. 
Traceless linkers can also be used, e.g., as described in Plunkett, M. J., and Ellman, J. A., 
(1995), J. Org. Chem., 60:6006. 

[0253] Typical functional groups, that are utilized to link binding compound(s), include, 
but not limited to, carboxylic acid, amine, hydroxyl, and thiol. (Examples can be found in 
Solid-supported combinatorial and parallel synthesis of small molecular weight compound 
libraries; (1998) Tetrahedron organic chemistry series Vol.17; Pergamon; p85). 
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[0254] As indicated above, labels can also be attached to a binding compound or to a 
linker attached to a binding compound. Such attachment may be direct (attached directly 
to the binding compound) or indirect (attached to a component that is directly or indirectly 
attached to the binding compound). Such labels allow detection of the compound either 
directly or indirectly. Attachement of labels can be performed using conventional 
chemistries. Labels can include, for example, fluorescent labels, radiolabels, light 
scattering particles, light absorbent particles, magnetic particles, enzymes, and specific 
binding agents (e.g., biotin or an antibody target moiety). 

Solid Phase Media 

[0255] Additional examples of components that can be attached directly or indirectly to 
a binding compound include various solid phase media. Similar to attachment of linkers 
and labels, attachment to solid phase media can be performed using conventional 
chemistries. Such solid phase media can include, for example, small components such as 
beads, nanoparticles, and fibers (e.g., in suspension or in a gel or chromatographic matrix). 
Likewise, solid phase media can include larger objects such as plates, chips, slides, and 
tubes. In many cases, the binding compound will be attached in only a portion of such an 
objects, e.g., in a spot or other local element on a generally flat surface or in a well or 
portion of a well. 

Identification of Biological Agents 

[0256] The posession of structural information about a protein also provides for the 
identification of useful biological agents, such as epitpose for development of antibodies, 
identification of mutation sites expected to affect activity, and identification of attachment 
sites allowing attachment of the protein to materials such as labels, linkers, peptides, and 
solid phase media. 

[0257] Antibodies (Abs) finds multiple applications in a variety of areas including 
biotechnology, medicine and diagnosis, and indeed they are one of the most powerful tools 
for life science research. Abs directed against protein antigens can recognize either linear 
or native three-dimensional (3D) epitopes. The obtention of Abs that recognize 3D 
epitopes require the use of whole native protein (or of a portion that assumes a native 
conformation) as immunogens. Unfortunately, this not always a choice due to various 
technical reasons: for example the native protein is just not available, the protein is toxic, 
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or its is desirable to utilize a high density antigen presentation. In such cases, 
immunization with peptides is the alternative. Of course, Abs generated in this manner 
will recognize linear epitopes, and they might or might not recognize the source native 
protein, but yet they will be useful for standard laboratory applications such as western 
blots. The selection of peptides to use as immunogens can be accomplished by following 
particular selection rules and/or use of epitope prediction software. 

[0258] Though methods to predict antigenic peptides are not infallible, there are several 
rules that can be followed to determine what peptide fragments from a protein are likely to 
be antigenic. These rules are also dictated to increase the likelihood that an Ab to a 
particular peptide will recognize the native protein. 

• 1 . Antigenic peptides should be located in solvent accessible regions and contain 
both hydrophobic and hydrophilic residues. 

o For proteins of known 3D structure, solvent accessibility can be determined 
using a variety of programs such as DSSP, NACESS, or WHATIF, among 
others. 

o If the 3D structure is not known, use any of the following web servers to 
predict accessibilities: PHD , JPRED, PredAcc (c) ACCpro 

• 2. Preferably select peptides lying in long loops connecting Secondary Structure 
(SS) motifs, avoiding peptides located in helical regions. This will increase the 
odds that the Ab recognizes the native protein. Such peptides can, for example, be 
identified from a crystal structure or crystal structure-based homology model. 

o For protein with known 3D coordinates, SS can be obtained from the 
sequence link of the relevant entry at the Brookhaven data bank . The 
PDBsum server also offer SS analysis of pdb records. 

o When no structure is available secondary structure predictions can be 
obtained from any of the following servers: PHD , JPRED . PSI-PRED , 
NNSP, etc 

• 3. When possible, choose peptides that are in the N- and C-terminal region of the 
protein. Because the N- and C- terminal regions of proteins are usually solvent 
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accessible and unstructured, Abs against those regions are also likely to recognize 
the native protein. 

• 4. For cell surface glycoproteins, eliminate from initial peptides those containing 
consesus sites for N-glycosilation. 

o N-glycosilation sites can be detected using Scanprosite , or NetNGlyc 

[0259] In addition, several methods based on various physio-chemical properties of 
experimental determined epitopes (flexibility, hydrophibility, accessibility) have been 
published for the prediction of antigenic determinants and can be used. The antigenic 
index and Preditop are example. 

[0260] Perhaps the simplest method for the prediction of antigenic determinants is that 
of Kolaskar and Tongaonkar, which is based on the occurrence of amino acid residues in 
experimentally determined epitopes. (Kolaskar and Tongaonkar (1990) A semi-empirical 
method for prediction of antigenic determinants on protein antigens. FEBBS Lett. 276(1- 
2): 172-1 74.) The prediction algorithm works as follows: 

• 1 . Calculate the average propensity for each overlapping 7-mer and assign the 
result to the central residue (i+3) of the 7-mer. 

• 2. Calculate the average for the whole protein. 

• 3. (a) If the average for the whole protein is above 1.0 then all residues having 
average propensity above 1 .0 are potentially antigenic. 

• 3. (b) If the average for the whole protein is below 1 .0 then all residues having 
above the average for the whole protein are potentially antigenic. 

• 4. Find 8-mers where all residues are selected by step 3 above (6-mers in the 
original paper) 

[0261] The Kolaskar and Tongaonkar method is also available from the GCG package, 
and it runs using the command egcg. 

[0262] Crystal structures also allow identification of residues at which mutation is likely 
to alter the activity of the protein. Such residues include, for example, residues that 
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interact with susbtrate, conserved active site residues, and residues that are in a region of 
ordered secondary structure of involved in tertiary interactions. The mutations that are 
likely to affect activity will vary for different molecular contexts. Mutations in an active 
site that will affect activity are typically substitutions or deletions that eliminate a charge- 
charge or hydrogen bonding interaction, or introduce a steric interference. Mutations in 
secondary structure regions or molecular interaction regions that are likely to affect 
activity include, for example, substitutions that alter the hydrophobicity/hydrophilicity of a 
region, or that introduce a sufficient strain in a region near or including the active site so 
that critical residue(s) in the active site are displaced. Such substitutions and/or deletions 
and/or insertions are recognized, and the predicted structural and/or energetic effects of 
mutations can be calculated using conventional software. 

IX. Phosphodiesterase Activity Assays 

[0263] A number of different assays for phosphodiesterase activity can be utilized for 
assaying for active modulators and/or determining specificity of a modulator for a 
particular phosphodiesterase or group or phosphodiesterases. In addition to the assay 
mentioned in the Examples below, one of ordinary skill in the art will know of other 
assays that can be utilized and can modify an assay for a particular application. For 
example, numerous papers concerning PDE5 as well as papers concerning other PDEs 
described assays that can be used. For example, useful assays are described in Fryburg et 
al., U.S. Patent Application Publication 2002/0165237, Thompson et al., U.S. Patent 
Application Publication 2002/0009764, Pamukcu et al., U.S. Patent Application 
09/046,739, and Pamukcu et al., U.S. Patent 6,500,610. 

[0264] An assay for phosphodiesterase activity that can be used for PDE5 A, can be 
performed according to the following procedure using purified PDE5A using the 
procedure described in Example 6. 

[0265] Additional alternative assays can employ binding determinations. For example, 
this sort of assay can be formatted either in a fluorescence resonance energy transfer 
(FRET) format, or using an AlphaScreen (amplified /uminescent proximity homogeneous 
assay) format by varying the donor and acceptor reagents that are attached to streptavidin 
or the phosphor-specific antibody. 
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X. Organic Synthetic Techniques 

[0266] The versatility of computer-based modulator design and identification lies in the 
diversity of structures screened by the computer programs. The computer programs can 
search databases that contain very large numbers of molecules and can modify modulators 
already complexed with the enzyme with a wide variety of chemical functional groups. A 
consequence of this chemical diversity is that a potential modulator of phosphodiesterase 
function may take a chemical form that is not predictable. A wide array of organic 
synthetic techniques exist in the art to meet the challenge of constructing these potential 
modulators. Many of these organic synthetic methods are described in detail in standard 
reference sources utilized by those skilled in the art. One example of suh a reference is 
March, 1 994, Advanced Organic Chemistry; Reactions, Mechanisms and Structure , New 
York, McGraw Hill. Thus, the techniques useful to synthesize a potential modulator of 
phosphodiesterase function identified by computer-based methods are readily available to 
those skilled in the art of organic chemical synthesis. 

XI. Administration 

[0267] The methods and compounds will typically be used in therapy for human 
patients. However, they may also be used to treat similar or identical diseases in other 
vertebrates such as other primates, sports animals, and pets such as horses, dogs and cats. 

[0268] Suitable dosage forms, in part, depend upon the use or the route of 
administration, for example, oral, transdermal, transmucosal, or by injection (parenteral). 
Such dosage forms should allow the compound to reach target cells. Other factors are 
well known in the art, and include considerations such as toxicity and dosage forms that 
retard the compound or composition from exerting its effects. Techniques and 
formulations generally may be found in Remington's Pharmaceutical Sciences, 18 th ed., 
Mack Publishing Co., Easton, PA, 1990 (hereby incorporated by reference herein). 

[0269] Compounds can be formulated as pharmaceutically acceptable salts. 
Pharmaceutically acceptable salts are non-toxic salts in the amounts and concentrations at 
which they are administered. The preparation of such salts can facilitate the 
pharmacological use by altering the physical characteristics of a compound without 
preventing it from exerting its physiological effect. Useful alterations in physical 
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properties include lowering the melting point to facilitate transmucosal administration and 
increasing the solubility to facilitate administering higher concentrations of the drug. 

[0270] Pharmaceutically acceptable salts include acid addition salts such as those 
containing sulfate, chloride, hydrochloride, fumarate, maleate, phosphate, sulfamate, 
acetate, citrate, lactate, tartrate, methanesulfonate, ethanesulfonate, benzenesulfonate, p- 
toluenesulfonate, cyclohexylsulfamate and quinate. Pharmaceutically acceptable salts can 
be obtained from acids such as hydrochloric acid, maleic acid, sulfuric acid, phosphoric 
acid, sulfamic acid, acetic acid, citric acid, lactic acid, tartaric acid, malonic acid, 
methanesulfonic acid, ethanesulfonic acid, benzenesulfonic acid, /?-toluenesulfonic acid, 
cyclohexylsulfamic acid, fumaric acid, and quinic acid. 

[0271] Pharmaceutically acceptable salts also include basic addition salts such as those 
containing benzathine, chloroprocaine, choline, diethanolamine, ethylenediamine, 
meglumine, procaine, aluminum, calcium, lithium, magnesium, potassium, sodium, 
ammonium, alkylamine, and zinc, when acidic functional groups, such as carboxylic acid 
or phenol are present. For example, see Remington's Pharmaceutical Sciences . 19 th ed., 
Mack Publishing Co., Easton, PA, Vol. 2, p. 1457, 1995. Such salts can be prepared using 
the appropriate corresponding bases. 

[0272] Pharmaceutically acceptable salts can be prepared by standard techniques. For 
example, the free-base form of a compound is dissolved in a suitable solvent, such as an 
aqueous or aqueous-alcohol in solution containing the appropriate acid and then isolated 
by evaporating the solution. In another example, a salt is prepared by reacting the free 
base and acid in an organic solvent. 

[0273] The pharmaceutically acceptable salt of the different compounds may be present 
as a complex. Examples of complexes include 8-chlorotheophylline complex (analogous 
to, e.g., dimenhydrinate: diphenhydramine 8-chlorotheophylline (1:1) complex; 
Dramamine) and various cyclodextrin inclusion complexes. 

[0274] Carriers or excipients can be used to produce pharmaceutical compositions. The 
carriers or excipients can be chosen to facilitate administration of the compound. 
Examples of carriers include calcium carbonate, calcium phosphate, various sugars such as 
lactose, glucose, or sucrose, or types of starch, cellulose derivatives, gelatin, vegetable 
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oils, polyethylene glycols and physiologically compatible solvents. Examples of 
physiologically compatible solvents include sterile solutions of water for injection (WFI), 
saline solution, and dextrose. 

[0275] The compounds can be administered by different routes including intravenous, 
intraperitoneal, subcutaneous, intramuscular, oral, transmucosal, rectal, or transdermal. 
Oral administration is preferred. For oral administration, for example, the compounds can 
be formulated into conventional oral dosage forms such as capsules, tablets, and liquid 
preparations such as syrups, elixirs, and concentrated drops. 

[0276] Pharmaceutical preparations for oral use can be obtained, for example, by 
combining the active compounds with solid excipients, optionally grinding a resulting 
mixture, and processing the mixture of granules, after adding suitable auxiliaries, if 
desired, to obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such 
as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations, for 
example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, 
methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose (CMC), 
and/or polyvinylpyrrolidone (PVP: povidone). If desired, disintegrating agents may be 
added, such as the cross — linked polyvinylpyrrolidone, agar, or alginic acid, or a salt 
thereof such as sodium alginate. 

[0277] Dragee cores are provided with suitable coatings. For this purpose, concentrated 
sugar solutions may be used, which may optionally contain, for example, gum arabic, talc, 
polyvinylpyrrolidone, carbopol gel, polyethylene glycol (PEG), and/or titanium dioxide, 
lacquer solutions, and suitable organic solvents or solvent mixtures. Dye-stuffs or 
pigments may be added to the tablets or dragee coatings for identification or to 
characterize different combinations of active compound doses. 

[0278] Pharmaceutical preparations that can be used orally include push-fit capsules 
made of gelatin ("gelcaps"), as well as soft, sealed capsules made of gelatin, and a 
plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active 
ingredients in admixture with filler such as lactose, binders such as starches, and/or 
lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft capsules, 
the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, 
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liquid paraffin, or liquid polyethylene glycols (PEGs). In addition, stabilizers may be 
added. 

[0279] Alternatively, injection (parenteral administration) may be used, e.g., 
intramuscular, intravenous, intraperitoneal, and/orsubcutaneous. For injection, the 
compounds of the invention are formulated in sterile liquid solutions, preferably in 
physiologically compatible buffers or solutions, such as saline solution, Hank's solution, or 
Ringer's solution. In addition, the compounds may be formulated in solid form and 
redissolved or suspended immediately prior to use. Lyophilized forms can also be 
produced. 

[0280] Administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, bile salts and fiisidic acid 
derivatives. In addition, detergents may be used to facilitate permeation. Transmucosal 
administration, for example, may be through nasal sprays or suppositories (rectal or 
vaginal). 

[0281] The amounts of various compound to be administered can be determined by 
standard procedures taking into account factors such as the compound ICso, the biological 
half-life of the compound, the age, size, and weight of the patient, and the disorder 
associated with the patient. The importance of these and other factors are well known to 
those of ordinary skill in the art. Generally, a dose will be between about 0.01 and 50 
mg/kg, preferably 0.1 and 20 mg/kg of the patient being treated. Multiple doses may be 
used. 

Manipulation of PDE5A 

[0282] As the full-length coding sequence and amino acid sequence of PDE5A is 
known, cloning, construction of recombinant hPIM-3, production and purification of 
recombinant protein, introduction of PDE5 A into other organisms, and other molecular 
biological manipulations of PDE5A are readily performed. 

[0283] Techniques for the manipulation of nucleic acids, such as, e.g., subcloning, 

labeling probes (e.g., random-primer labeling using Klenow polymerase, nick translation, 

79 



Atty. Dkt. No.: 039363-1106 



amplification), sequencing, hybridization and the like are well disclosed in the scientific 
and patent literature, see, e.g., Sambrook, ed., Molecular Cloning: a Laboratory Manual 
(2nd ed.), Vols. 1-3, Cold Spring Harbor Laboratory, (1989); Current Protocols in 
Molecular Biology, Ausubel, ed. John Wiley & Sons, Inc., New York (1997); Laboratory 
Techniques in Biochemistry and Molecular Biology: Hybridization With Nucleic Acid 
Probes, Part I. Theory and Nucleic Acid Preparation, Tijssen, ed. Elsevier, N.Y. (1993). 
[0100] Nucleic acid sequences can be amplified as necessary for further use using 
amplification methods, such as PCR, isothermal methods, rolling circle methods, etc., are 
well known to the skilled artisan. See, e.g., Saiki, "Amplification of Genomic DNA" in 
PCR Protocols, Innis et al., Eds., Academic Press, San Diego, CA 1990, pp 13-20; 
Wharam et al., Nucleic Acids Res. 2001 Jun 1 ;29(1 1):E54-E54; Hafner et al., 
Biotechniques 2001 Apr;30(4):852-6, 858, 860 passim; Zhong et al., Biotechniques 2001 
Apr;30(4):852-6, 858, 860 passim. 

[0284] Nucleic acids, vectors, capsids, polypeptides, and the like can be analyzed and 
quantified by any of a number of general means well known to those of skill in the art. 
These include, e.g., analytical biochemical methods such as NMR, spectrophotometry, 
radiography, electrophoresis, capillary electrophoresis, high performance liquid 
chromatography (HPLC), thin layer chromatography (TLC), and hyperdiffusion 
chromatography, various immunological methods, e.g. fluid or gel precipitin reactions, 
immunodiffusion, immuno-electrophoresis, radioimmunoassays (RIAs), enzyme-linked 
immunosorbent assays (ELISAs), immuno-fluorescent assays, Southern analysis, Northern 
analysis, dot-blot analysis, gel electrophoresis (e.g., SDS-PAGE), nucleic acid or target or 
signal amplification methods, radiolabeling, scintillation counting, and affinity 
chromatography. 

[0285] Obtaining and manipulating nucleic acids used to practice the methods of the 
invention can be performed by cloning from genomic samples, and, if desired, screening 
and re-cloning inserts isolated or amplified from, e.g., genomic clones or cDNA clones. 
Sources of nucleic acid used in the methods of the invention include genomic or cDNA 
libraries contained in, e.g., mammalian artificial chromosomes (MACs), see, e.g., U.S. 
Patent Nos. 5,721,1 18; 6,025,155; human artificial chromosomes, see, e.g., Rosenfeld 
(1997) Nat. Genet. 15:333-335; yeast artificial chromosomes (YAC); bacterial artificial 
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chromosomes (BAC); PI artificial chromosomes, see, e.g., Woon (1998) Genomics 
50:306-316; Pl-derived vectors (PACs), see, e.g., Kern (1997) Biotechniques 23:120-124; 
cosmids, recombinant viruses, phages or plasmids. 

[0286] The nucleic acids of the invention can be operatively linked to a promoter. A 
promoter can be one motif or an array of nucleic acid control sequences which direct 
transcription of a nucleic acid. A promoter can include necessary nucleic acid sequences 
near the start site of transcription, such as, in the case of a polymerase II type promoter, a 
TATA element. A promoter also optionally includes distal enhancer or repressor elements 
which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter which is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter 
which is under environmental or developmental regulation. A "tissue specific" promoter 
is active in certain tissue types of an organism, but not in other tissue types from the same 
organism. The term "operably linked" refers to a functional linkage between a nucleic 
acid expression control sequence (such as a promoter, or array of transcription factor 
binding sites) and a second nucleic acid sequence, wherein the expression control 
sequence directs transcription of the nucleic acid corresponding to the second sequence. 

[0287] The nucleic acids of the invention can also be provided in expression vectors and 
cloning vehicles, e.g., sequences encoding the polypeptides of the invention. Expression 
vectors and cloning vehicles of the invention can comprise viral particles, baculovirus, 
phage, plasmids, phagemids, cosmids, fosmids, bacterial artificial chromosomes, viral 
DNA (e.g., vaccinia, adenovirus, foul pox virus, pseudorabies and derivatives of SV40), 
PI -based artificial chromosomes, yeast plasmids, yeast artificial chromosomes, and any 
other vectors specific for specific hosts of interest (such as bacillus, Aspergillus and yeast). 
Vectors of the invention can include chromosomal, non-chromosomal and synthetic DNA 
sequences. Large numbers of suitable vectors are known to those of skill in the art, and 
are commercially available. 

[0288] The nucleic acids of the invention can be cloned, if desired, into any of a variety 
of vectors using routine molecular biological methods; methods for cloning in vitro 
amplified nucleic acids are disclosed, e.g., U.S. Pat. No. 5,426,039. To facilitate cloning 
of amplified sequences, restriction enzyme sites can be "built into" a PCR primer pair. 
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Vectors may be introduced into a genome or into the cytoplasm or a nucleus of a cell and 
expressed by a variety of conventional techniques, well described in the scientific and 
patent literature. See, e.g., Roberts (1987) Nature 328:731; Schneider (1995) Protein 
Expr. Purif. 6435:10; Sambrook, Tijssen or Ausubel. The vectors can be isolated from 
natural sources, obtained from such sources as ATCC or GenBank libraries, or prepared 
by synthetic or recombinant methods. For example, the nucleic acids of the invention can 
be expressed in expression cassettes, vectors or viruses which are stably or transiently 
expressed in cells (e.g., episomal expression systems). Selection markers can be 
incorporated into expression cassettes and vectors to confer a selectable phenotype on 
transformed cells and sequences. For example, selection markers can code for episomal 
maintenance and replication such that integration into the host genome is not required. 

[0289] In one aspect, the nucleic acids of the invention are administered in vivo for in 

situ expression of the peptides or polypeptides of the invention. The nucleic acids can be 

administered as "naked DNA" (see, e.g., U.S. Patent No. 5,580,859) or in the form of an 

expression vector, e.g., a recombinant virus. The nucleic acids can be administered by any 

route, including peri- or intra-tumorally, as described below. Vectors administered in vivo 

can be derived from viral genomes, including recombinantly modified enveloped or 

non-enveloped DNA and RNA viruses, preferably selected from baculoviridiae, 

parvoviridiae, picornoviridiae, herpesveridiae, poxviridae, adenoviridiae, or 

picornnaviridiae. Chimeric vectors may also be employed which exploit advantageous 

merits of each of the parent vector properties (See e.g., Feng (1997) Nature Biotechnology 

15:866-870). Such viral genomes may be modified by recombinant DNA techniques to 

include the nucleic acids of the invention; and may be further engineered to be replication 

deficient, conditionally replicating or replication competent. In alternative aspects, vectors 

are derived from the adenoviral (e.g., replication incompetent vectors derived from the 

human adenovirus genome, see, e.g., U.S. Patent Nos. 6,096,718; 6,1 10,458; 6,1 13,913; 

5,631,236); adeno-associated viral and retroviral genomes. Retroviral vectors can include 

those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), 

Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and 

combinations thereof; see, e.g., U.S. Patent Nos. 6,1 17,681; 6,107,478; 5,658,775; 

5,449,614; Buchscher (1992) J. Virol. 66:2731-2739; Johann (1992) J. Virol 66:1635- 

1640). Adeno-associated virus (AAV)-based vectors can be used to transduce cells with 

target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and in in 
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vivo and ex vivo gene therapy procedures; see, e.g., U.S. Patent Nos. 6,1 10,456; 5,474,935; 
Okada (1996) Gene Ther. 3:957-964. 

[0290] The present invention also relates to fusion proteins, and nucleic acids encoding 
them. A polypeptide of the invention can be fused to a heterologous peptide or 
polypeptide, such as N-terminal identification peptides which impart desired 
characteristics, such as increased stability or simplified purification. Peptides and 
polypeptides of the invention can also be synthesized and expressed as fusion proteins 
with one or more additional domains linked thereto for, e.g., producing a more 
immunogenic peptide, to more readily isolate a recombinantly synthesized peptide, to 
identify and isolate antibodies and antibody-expressing B cells, and the like. Detection 
and purification facilitating domains include, e.g., metal chelating peptides such as 
polyhistidine tracts and histidine-tryptophan modules that allow purification on 
immobilized metals, protein A domains that allow purification on immobilized 
immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification 
system (Immunex Corp, Seattle WA). The inclusion of a cleavable linker sequences such 
as Factor Xa or enterokinase (Inyitrogen, San Diego CA) between a purification domain 
and the motif-comprising peptide or polypeptide to facilitate purification. For example, an 
expression vector can include an epitope-encoding nucleic acid sequence linked to six 
histidine residues followed by a thioredoxin and an enterokinase cleavage site (see e.g., 
Williams (1995) Biochemistry 34:1787-1797; Dobeli (1998) Protein Expr. Purif. 12:404- 
414). The histidine residues facilitate detection and purification while the enterokinase 
cleavage site provides a means for purifying the epitope from the remainder of the fusion 
protein. In one aspect, a nucleic acid encoding a polypeptide of the invention is assembled 
in appropriate phase with a leader sequence capable of directing secretion of the translated 
polypeptide or fragment thereof. Technology pertaining to vectors encoding fusion 
proteins and application of fusion proteins are well disclosed in the scientific and patent 
literature, see e.g., Kroll (1993) DNA Cell. Biol 12:441-53. 

[0291] The nucleic acids and polypeptides of the invention can be bound to a solid 
support, e.g., for use in screening and diagnostic methods. Solid supports can include, 
e.g., membranes (e.g., nitrocellulose or nylon), a microtiter dish (e.g., PVC, 
polypropylene, or polystyrene), a test tube (glass or plastic), a dip stick (e.g., glass, PVC, 
polypropylene, polystyrene, latex and the like), a microfuge tube, or a glass, silica, plastic, 
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metallic or polymer bead or other substrate such as paper. One solid support uses a metal 
(e.g., cobalt or nickel)-comprising column which binds with specificity to a histidine tag 
engineered onto a peptide. 

[0292] Adhesion of molecules to a solid support can be direct (i.e., the molecule contacts 
the solid support) or indirect (a "linker" is bound to the support and the molecule of 
interest binds to this linker). Molecules can be immobilized either covalently (e.g., 
utilizing single reactive thiol groups of cysteine residues (see, e.g., Colliuod (1993) 
Bioconjugate Chem. 4:528-536) or non-covalently but specifically (e.g., via immobilized 
antibodies (see, e.g., Schuhmann (1991) Adv. Mater. 3:388-391; Lu (1995) Anal. Chem. 
67:83-87; the biotin/strepavidin system (see, e.g., Iwane (1997) Biophys. Biochem. Res. 
Comm. 230:76-80); metal chelating, e.g., Langmuir-Blodgett films (see, e.g., Ng (1995) 
Langmuir 1 1:4048-55); metal-chelating self-assembled monolayers (see, e.g., Sigal (1996) 
Anal. Chem. 68:490-497) for binding of polyhistidine fusions. 

[0293] Indirect binding can be achieved using a variety of linkers which are 
commercially available. The reactive ends can be any of a variety of functionalities 
including, but not limited to: amino reacting ends such as N-hydroxysuccinimide (NHS) 
active esters, imidoesters, aldehydes, epoxides, sulfonyl halides, isocyanate, 
isothiocyanate, and nitroaryl halides; and thiol reacting ends such as pyridyl disulfides, 
maleimides, thiophthalimides, and active halogens. The heterobifunctional crosslinking 
reagents have two different reactive ends, e.g., an amino-reactive end and a thiol-reactive 
end, while homobifunctional reagents have two similar reactive ends, e.g., 
bismaleimidohexane (BMH) which permits the cross-linking of sulfhydryl-containing 
compounds. The spacer can be of varying length and be aliphatic or aromatic. Examples 
of commercially available homobifunctional cross-linking reagents include, but are not 
limited to, the imidoesters such as dimethyl adipimidate dihydrochloride (DMA); 
dimethyl pimelimidate dihydrochloride (DMP); and dimethyl suberimidate 
dihydrochloride (DMS). Heterobifunctional reagents include commercially available 
active halogen-NHS active esters coupling agents such as N-succinimidyl bromoacetate 
and N-succinimidyl (4-iodoacetyl)aminobenzoate (SIAB) and the sulfosuccinimidyl 
derivatives such as sulfosuccinimidyl(4-iodoacetyl)aminobenzoate (sulfo-SIAB) (Pierce). 
Another group of coupling agents is the heterobifunctional and thiol cleavable agents 
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such as N-succinimidyl 3-(2-pyridyidithio)propionate (SPDP) (Pierce Chemicals, 
Rockford, IL). 

[0294] Antibodies can also be used for binding polypeptides and peptides of the 
invention to a solid support. This can be done directly by binding peptide-specific 
antibodies to the column or it can be done by creating fusion protein chimeras comprising 
motif-containing peptides linked to, e.g., a known epitope (e.g., a tag (e.g., FLAG, myc) or 
an appropriate immunoglobulin constant domain sequence (an "immunoadhesin," see, 
e.g., Capon (1989) Nature 377:525-531 (1989). 

[0295] Nucleic acids or polypeptides of the invention can be immobilized to or applied 
to an array. Arrays can be used to screen for or monitor libraries of compositions (e.g., 
small molecules, antibodies, nucleic acids, etc.) for their ability to bind to or modulate the 
activity of a nucleic acid or a polypeptide of the invention. For example, in one aspect of 
the invention, a monitored parameter is transcript expression of a gene comprising a 
nucleic acid of the invention. One or more, or, all the transcripts of a cell can be measured 
by hybridization of a sample comprising transcripts of the cell, or, nucleic acids 
representative of or complementary to transcripts of a cell, by hybridization to 
immobilized nucleic acids on an array, or "biochip." By using an "array" of nucleic acids 
on a microchip, some or all of the transcripts of a cell can be simultaneously quantified. 
Alternatively, arrays comprising genomic nucleic acid can also be used to determine the 
genotype of a newly engineered strain made by the methods of the invention. Polypeptide 
arrays" can also be used to simultaneously quantify a plurality of proteins. 

[0296] The terms "array" or "microarray" or "biochip" or "chip" as used herein is a 
plurality of target elements, each target element comprising a defined amount of one or 
more polypeptides (including antibodies) or nucleic acids immobilized onto a defined area 
of a substrate surface. In practicing the methods of the invention, any known array and/or 
method of making and using arrays can be incorporated in whole or in part, or variations 
thereof, as disclosed, for example, in U.S. Patent Nos. 6,277,628; 6,277,489; 6,261,776; 
6,258,606; 6,054,270; 6,048,695; 6,045,996; 6,022,963; 6,013,440; 5,965,452; 5,959,098; 
5,856,174; 5,830,645; 5,770,456; 5,632,957; 5,556,752; 5,143,854; 5,807,522; 5,800,992; 
5,744,305; 5,700,637; 5,556,752; 5,434,049; see also, e.g., WO 99/51773; WO 99/09217; 
WO 97/46313; WO 96/17958; see also, e.g., Johnston (1998) Curr. Biol 8:R171-R174; 
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Schummer (1997) Biotechniques 23:1087-1092; Kern (1997) Biotechniques 23:120-124; 
Solinas-Toldo (1997) Genes, Chromosomes & Cancer 20:399-407; Bowtell (1999) Nature 
Genetics Supp. 21 :25-32. See also published U.S. patent applications Nos. 20010018642; 
20010019827; 20010016322; 20010014449; 20010014448; 20010012537; 20010008765. 

Host Cells and Transformed Cells 

[0297] The invention also provides a transformed cell comprising a nucleic acid 
sequence of the invention, e.g., a sequence encoding a polypeptide of the invention, or a 
vector of the invention. The host cell may be any of the host cells familiar to those skilled 
in the art, including prokaryotic cells, eukaryotic cells, such as bacterial cells, fungal cells, 
yeast cells, mammalian cells, insect cells, or plant cells. Exemplary bacterial cells include 
E. coli, Streptomyces, Bacillus subtilis, Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces, and Staphylococcus. Exemplary insect 
cells include Drosophila S2 and Spodoptera Sf9. Exemplary animal cells include CHO, 
COS or Bowes melanoma or any mouse or human cell line. The selection of an 
appropriate host is within the abilities of those skilled in the art. 

[0298] Vectors may be introduced into the host cells using any of a variety of 
techniques, including transformation, transfection, transduction, viral infection, gene guns, 
or Ti-mediated gene transfer. Particular methods include calcium phosphate transfection, 
DEAE-Dextran mediated transfection, lipofection, or electroporation. 

[0299] Engineered host cells can be cultured in conventional nutrient media modified as 
appropriate for activating promoters, selecting transformants or amplifying the genes of 
the invention. Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter may be induced by appropriate 
means (e.g., temperature shift or chemical induction) and the cells may be cultured for an 
additional period to allow them to produce the desired polypeptide or fragment thereof. 

[0300] Cells can be harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract is retained for further purification. Microbial cells 
employed for expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. Such 
methods are well known to those skilled in the art. The expressed polypeptide or fragment 
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can be recovered and purified from recombinant cell cultures by methods including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Protein refolding steps can be used, as necessary, in completing 
configuration of the polypeptide. If desired, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. 

[0301] Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts and other cell lines capable of expressing proteins from 
a compatible vector, such as the CI 27, 3T3, CHO, HeLa and BHK cell lines. 

[0302] The constructs in host cells can be used in a conventional manner to produce the 
gene product encoded by the recombinant sequence. Depending upon the host employed 
in a recombinant production procedure, the polypeptides produced by host cells containing 
the vector may be glycosylated or may be non-glycosylated. Polypeptides of the invention 
may or may not also include an initial methionine amino acid residue. 

[0303] Cell-free translation systems can also be employed to produce a polypeptide of 
the invention. Cell-free translation systems can use mRNAs transcribed from a DNA 
construct comprising a promoter operably linked to a nucleic acid encoding the 
polypeptide or fragment thereof. In some aspects, the DNA construct may be linearized 
prior to conducting an in vitro transcription reaction. The transcribed mRNA is then 
incubated with an appropriate cell-free translation extract, such as a rabbit reticulocyte 
extract, to produce the desired polypeptide or fragment thereof. 

[0304] The expression vectors can contain one or more selectable marker genes to 
provide a phenotypic trait for selection of transformed host cells such as dihydrofolate 
reductase or neomycin resistance for eukaryotic cell culture, or such as tetracycline or 
ampicillin resistance in E. coli. 

[0305] For transient expression in mammalian cells, cDNA encoding a polypeptide of 
interest may be incorporated into a mammalian expression vector, e.g. pcDNAl, which is 
available commercially from Invitrogen Corporation (San Diego, Calif., U.S.A.; catalogue 
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number V490-20). This is a multifunctional 4.2 kb plasmid vector designed for cDNA 
expression in eukaryotic systems, and cDNA analysis in prokaryotes, incorporated on the 
vector are the CMV promoter and enhancer, splice segment and polyadenylation signal, an 
SV40 and Polyoma virus origin of replication, and Ml 3 origin to rescue single strand 
DNA for sequencing and mutagenesis, Sp6 and T7 RNA promoters for the production of 
sense and anti-sense RNA transcripts and a Col El -like high copy plasmid origin. A 
polylinker is located appropriately downstream of the CMV promoter (and 3' of the T7 
promoter). 

[0306] The cDNA insert may be first released from the above phagemid incorporated at 
appropriate restriction sites in the pcDNAI polylinker. Sequencing across the junctions 
may be performed to confirm proper insert orientation in pcDNAI. The resulting plasmid 
may then be introduced for transient expression into a selected mammalian cell host, for 
example, the monkey-derived, fibroblast like cells of the COS-1 lineage (available from 
the American Type Culture Collection, Rockville, Md. as ATCC CRL 1650). 

[0307] For transient expression of the protein-encoding DNA, for example, COS-1 cells 
may be transfected with approximately 8 jig DNA per 10 6 COS cells, by DEAE-mediated 
DNA transfection and treated with chloroquine according to the procedures described by 
Sambrook et al, Molecular Cloning: A Laboratory Manual, 1989, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor N.Y, pp. 16.30-16.37. An exemplary method is as 
follows. Briefly, COS-1 cells are plated at a density of 5 x 10 6 cells/dish and then grown 
for 24 hours in FBS-supplemented DMEM/F12 medium. Medium is then removed and 
cells are washed in PBS and then in medium. A transfection solution containing DEAE 
dextran (0.4 mg/ml), 100 jxM chloroquine, 10% NuSerum, DNA (0.4 mg/ml) in 
DMEM/F12 medium is then applied on the cells 10 ml volume. After incubation for 3 
hours at 37 °C, cells are washed in PBS and medium as just described and then shocked 
for 1 minute with 10% DMSO in DMEM/F12 medium. Cells are allowed to grow for 2-3 
days in 10% FBS-supplemented medium, and at the end of incubation dishes are placed on 
ice, washed with ice cold PBS and then removed by scraping. Cells are then harvested by 
centrifugation at 1000 rpm for 10 minutes and the cellular pellet is frozen in liquid 
nitrogen, for subsequent use in protein expression. Northern blot analysis of a thawed 
aliquot of frozen cells may be used to confirm expression of receptor-encoding cDNA in 
cells under storage. 
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[0308] In a like manner, stably transfected cell lines can also prepared, for example, 
using two different cell types as host: CHO Kl and CHO Pro5. To construct these cell 
lines, cDNA coding for the relevant protein may be incorporated into the mammalian 
expression vector pRC/CMV (Invitrogen), which enables stable expression. Insertion at 
this site places the cDNA under the expression control of the cytomegalovirus promoter 
and upstream of the polyadenylation site and terminator of the bovine growth hormone 
gene, and into a vector background comprising the neomycin resistance gene (driven by 
the SV40 early promoter) as selectable marker. 

[0309] An exemplary protocol to introduce plasmids constructed as described above is 
as follows. The host CHO cells are first seeded at a density of 5xl0 5 in 10% FBS- 
supplemented MEM medium. After growth for 24 hours, fresh medium is added to the 
plates and three hours later, the cells are transfected using the calcium phosphate-DNA co- 
precipitation procedure (Sambrook et al, supra). Briefly, 3 |ig of DNA is mixed and 
incubated with buffered calcium solution for 10 minutes at room temperature. An equal 
volume of buffered phosphate solution is added and the suspension is incubated for 1 5 
minutes at room temperature. Next, the incubated suspension is applied to the cells for 4 
hours, removed and cells were shocked with medium containing 15% glycerol. Three 
minutes later, cells are washed with medium and incubated for 24 hours at normal growth 
conditions. Cells resistant to neomycin are selected in 10% FBS-supplemented alpha- 
MEM medium containing G418 (1 mg/ml). Individual colonies of G418-resistant cells are 
isolated about 2-3 weeks later, clonally selected and then propagated for assay purposes. 



EXAMPLES 

[0310] A number of examples involved in the present invention are described below. In 
most cases, alternative techniques could also be used. For example, techniques, methods, 
and other information described in Whitaker et al., U.S. Patent Application 2001/0053780 
can be used in the present invention. Such techniques and information include, without 
limitation, cloning, culturing, purification, assaying, screening, use of modulators, 
sequence information, and information concerning biological role of PDE5A. Each of 
these references is incorporated by reference herein in its entirety, including drawings. 
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EXAMPLE 1 : Cloning of PDE5A Phosphodiesterase Domain 

[0311] PDE5A cDNA sequence was amplified from a Human Kidney QUICK-Clone 
cDNA library (Clontech, #71 12-1) by PCR using the following primers: 



PDE5A-S: 5'-GTCGTAT CATATG TC AGC AGC AG AGG A AG AA AC-3 ' 33 mer 
PDE5A-A: S'-TCTGCA GTCGAC AGGCCACTCAGTTCCGCTTG-3 ' 32 mer 

[0312] The resulting PCR fragment was digested with Ndel and Sail and subcloned into 
the pET15S vector (shown below). In this expression plasmid, residues 531-875 of 
PDE5A are in frame with an N-terminal His-tag followed by a thrombin cleavage site. 

[0313] The sequence of pET15S, with multi-cloning site is shown below: 
T7 promoter 

AGATCTCGATCCCGCGAAATTAATACGACTCACTATAGGGG7^ATTGTGAGCGGATAACAATTCCCC 



RBS 

T C T AGAAATAAT TTTGTT TAAC T T TAAGAAG G AGAT AT AC C 



Ndel 

ATGGGCAGCAGCCATCATCATCATCATCACAGCAGCGGCCTGGTGCCGCGCGGCAGCCATATG GGATCC GG 
MGSSHHHHHHSSGLVPRGSHM 



StuI Sail 

AATTCAAAGGCCTACGTCGACTAGAGCCTGCAGTCTCGACCATCATCATCATCATCATTAATAAAAGGl 



Spel BamHI 

5GCCGTTACTAGTGGATCCGGCTGCTAACAAAGCCCGAAAGGAAGCTGAGTTGG 



IVEX-3 Primer 



Bpull02 I 
CTGCTGCCAC dT " 

3' -PET Primer 



T7 terminator 

\CCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTG 



[0314] pET15S vector is derived from pET15b vector (Novagen) for bacterial 
expression to produce the proteins with N-terminal His6. This vector was modified by 
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replacement of Ndel-BamHI fragment to others to create a Sail site and stop codon 
(TAG). Vector size is 5814 bp. Insertion can be performed using Ndel-Sall site. 

[0315] The nucleotide and amino acid sequences for the PDE5 A phosphodiesterase 
domain utilized encompass amino acids 531-875 of the amino acid sequence provided in 
Table 4. 

EXAMPLE 2: Expression and Purification of PDE5A Phosphodiesterase Domain 

[0316] PDE 5 A is purified from E. coli cells [BL21(DE3)Codon Plus(RIL) (Novagen)] 
grown in Terrific broth that has been supplemented with 0.2mM Zinc Acetate and ImM 
MgC12 and induced for 16-20h with 1 mM IPTG at 22 C. The centrifuged bacterial pellet 
(typically 200-250g from 16 L) is suspended in lysis buffer (0.1M potassium phosphate 
buffer, pH 8.0, 10% glycerol, 1 mM PMSF). lOOug/ml of lysozyme is added to the lysate 
and the cells are lysed in a Cell Disruptor (MircoFluidics). The cell extract is clarified at 
5000 rpm in a Sorvall SA6000 rotor for lh, and the supernatant is recentrifuged for 
another hour at 17000 rpm in a Sorvall SA 600 rotor. 5 mM imidazole (pH 8.0) is added 
to the clarified supernatant and 2 ml of cobalt beads (50% slurry) is added to each 35 ml of 
extract. The beads are mixed at 4 C for 3-4 h on a Nutator and the beads are recovered by 
centrifugation at 4000 rpm for 3 min. The pelleted beads are washed several times with 
lysis buffer and the beads are packed on a BioRad disposable column. The bound protein 
is eluted with 3-4 column volumes of 0.1M imidazole followed by 0.25M imidazole, both 
prepared in lysis buffer. The protein eluted from the cobalt beads is concentrated on 
Centriprep-10 membranes (Amicon) and separated on a Pharmacia Superdex 200 column 
(26/60) in low salt buffer (25 mM Tris-HCl, pH 8.0, 1 50 mM NaCl, 14 mM beta- 
mercaptoethanol). The uncleaved PDE5A is purified by hydroxyapatite chromatography 
eluted with a phosphate gradient. A final buffer exchange is done on a Pharmacia 
Superdex 200 column (26/60) in 25 mM Tris-HCl, pH 8.0, 150 mM NaCl, 14 mM beta- 
mercaptoethanol. 

Example 3: Crystallization of PDE5A Phosphodiesterase Domain 

[0317] Crystals of purified PDE 5 were grown in 10% (w/v) PEG3000, 100 mM 
phosphate-citrate (pH 4.3), 200 mM NaCl, ImM DTT, ImM Sp-cAMP and 8 mg/ml 
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protein at 4°C, using an Intelliplate (Robbins Scientific, Hampton) by mixing one 
microliter of protein with one microliter of precipitant, also at 4°C. 

Example 4: Diffraction Analysis of PDE5A 

[0318] Synchrotron X-ray data for PDE5A was collected at beamline 8.3.1 of the 
Advanced Light Source (ALS, Lawrence Berkeley National Laboratory, Berkeley) on a 
Quantum 210 charge-coupled device detector (A, = 1.1 OA). The data were processed using 
Mosflm () and scaled and reduced with Scala () in CCP4 (). The data processing process 
was driven by the ELVES automation scripts. 

[0319] A ribbon diagram of the PDE5A catalytic domain is shown in Figure 1 . Atomic 
coordinates for the apo protein are provided in Table 1 . 

Example 5: PDE5A Binding Assays 

[0320] Binding assays can be performed in a variety of ways, including a variety of 
ways known in the art. For example, as indicated above, binding assays can be performed 
using fluorescence resonance energy transfer (FRET) format, or using an AlphaScreen 

[0321] Alternatively, any method which can measure binding of a ligand to the cGMP- 
binding site can be used. For example, a fluorescent ligand can be used. When bound to 
PDE5 A, the emitted fluorescence is polarized. Once displaced by inhibitor binding, the 
polarization decreases. 

[0322] Determination of IC50 for compounds by competitive binding assays. (Note that 
Ki is the dissociation constant for inhibitor binding; K D is the dissociation constant for 
substrate binding.) For this system, the IC50, inhibitor binding constant and substrate 
binding constant can be interrelated according to the following formula: 

[0323] When using radiolabeled substrate Ki = IC50 

1+ [L*]/K D 

[0324] the IC50 ~ K r when there is a small amount of labeled substrate. 
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Example 6: PDE5A Activity Assay 

[0325] As an exemplary phosphodiesterase assay, the effect of potential modulators 
phosphodiesterase activity of PDE5A and other PDEs was measured in the following 
assay format: 



Reagents 



Assay Buffer 

50 mM Tris, 7.5 
8.3 mM MgCl 2 
1.7 mM EGTA 
0.01% BS A 



Store @ 4 degrees 



RNA binding YSi SPA beads 



Beads are 100 mg/ml in water. Dilute to 5 mg/ml in 18 mM Zn using 1M 
ZnAcetate/ZnSC>4 solution(3:l) and water. Store @ 4 degrees. 



Low control compounds Concentration of 20X DMSO Stock 

PDE1B: 8-methoxymethyl IBMX 20 mM 

PDE2A: EHNA 10 mM 

PDE3B: Milrinone 2 mM 

PDE4D: Rolipram 10 mM 

PDE5A: Zaprinast 10 mM 

PDE7B: IBMX 40 mM 

PDE 1 OA : Dipyridamole 4 mM 



Enzyme concentrations (2X final concentration. Diluted in assay buffer) 
PDE IB 50 ng/ml 
PDE2A 50 ng/ml 
PDE3B 10 ng/ml 
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PDE4D 5ng/ml 
PDE5A 20 ng/ml 
PDE7B 25 ng/ml 
PDE10A5ng/ml) 

Radioligands 

[ 3 H] cAMP (Amersham TRK559). Dilute 2000X in assay buffer. 

[ 3 H] cGMP (Amersham TRK392). For PDE5A assay only. Dilute 2000X in assay buffer. 
Protocol 

Make assay plates from 2mM, 96 well master plates by transferring lul of 
compound to 384 well plate using BiomekFx. Final concentration of compounds 
will be -100 \xM. Duplicate assay plates are prepared from each master plate so 
that compounds are assayed in duplicate. 

To column 23 of the assay plate add lul of 20X DMSO stock of appropriate 
control compound. These will be the low controls. 

Columns 1 and 2 of Chembridge library assay plates and columns 21 and 22 of the 
Maybridge library assay plates have lul DMSO. These are the high controls. 
Using BiomekFx, pipet 10 j^l of radioligand into each assay well, then, using the 
same tips, pipet 10 \il of enzyme into each well. 

Seal assay plate with transparent cover. Centrifuge briefly @ 1000 RPM, them 
mix on plate shaker for 10 s. 

- Incubate @ 30° for 30 min. 

- Using BiomekFx, add 10 jitl of bead mixture to each assay well. Mix beads 
thoroughly in reservoir immediately prior to each assay plate addition. 
Re-seal plate with fresh transparent cover. Mix on plate shaker for 10 s, then 
centrifuge for 1 min. @ 1000 RPM. 

Place plates in counting racks. Let stand for > 30 min, then count on Wallac 
TriLux using program 8. 

Analyze data as % inhibition of enzyme activity. Average of high controls = 0% 
inhibition. Average of low controls = 100% inhibition. 
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Example 9: Site-directed Mutagenesis of PDE5A 

[0326] Mutagenesis of PDE5A can be carried out according to the following procedure 
as described in Molecular Biology: Current Innovations and Future Trends. Eds. A.M. 
Griffin and H.G.Griffin. (1995) ISBN 1-898486-01-8, Horizon Scientific Press, PO Box 1, 
Wymondham, Norfolk, U.K., among others. 

[0327] In vitro site-directed mutagenesis is an invaluable technique for studying protein 
structure-function relationships, gene expression and vector modification. Several methods 
have appeared in the literature, but many of these methods require single-stranded DNA as 
the template. The reason for this, historically, has been the need for separating the 
complementary strands to prevent reannealing. Use of PCR in site-directed mutagenesis 
accomplishes strand separation by using a denaturing step to separate the complementing 
strands and allowing efficient polymerization of the PCR primers. PCR site-directed 
methods thus allow site-specific mutations to be incorporated in virtually any double- 
stranded plasmid; eliminating the need for M13-based vectors or single-stranded rescue. 

[0328] It is often desirable to reduce the number of cycles during PCR when performing 
PCR-based site-directed mutagenesis to prevent clonal expansion of any (undesired) 
second-site mutations. Limited cycling which would result in reduced product yield, is 
offset by increasing the starting template concentration. A selection is used to reduce the 
number of parental molecules coming through the reaction. Also, in order to use a single 
PCR primer set, it is desirable to optimize the long PCR method. Further, because of the 
extendase activity of some thermostable polymerases it is often necessary to incorporate 
an end-polishing step into the procedure prior to end-to-end ligation of the PCR-generated 
product containing the incorporated mutations in one or both PCR primers. 

[0329] The following protocol provides a facile method for site-directed mutagenesis 
and accomplishes the above desired features by the incorporation of the following steps: 
(i) increasing template concentration approximately 1 000-fold over conventional PCR 
conditions; (ii) reducing the number of cycles from 25-30 to 5-10; (iii) adding the 
restriction endonuclease Dpnl (recognition target sequence: 5-Gm6ATC-3, where the A 
residue is methylated) to select against parental DNA (note: DNA isolated from almost all 
common strains of E. coli is Dam-methylated at the sequence 5-GATC-3); (iv) using Taq 
Extender in the PCR mix for increased reliability for PCR to 10 kb; (v) using Pfu DNA 
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polymerase to polish the ends of the PCR product, and (vi) efficient intramolecular 
ligation in the presence of T4 DNA ligase. 

[0330] Plasmid template DNA (approximately 0.5 pmole) is added to a PCR cocktail 
containing, in 25 ul of lx mutagenesis buffer: (20 mM Tris HC1, pH 7.5; 8 mM MgC12; 40 
ug/ml BSA); 12-20 pmole of each primer (one of which must contain a 5-prime 
phosphate), 250 uM each dNTP, 2.5 U Taq DNA polymerase, 2.5 U of Taq Extender 
(Stratagene). 

[0331] The PCR cycling parameters are 1 cycle of: 4 min at 94 C, 2 min at 50 C and 2 
min at 72 C; followed by 5-10 cycles of 1 min at 94 C, 2 min at 54 C and 1 min at 72 C 
(step 1). 

[0332] The parental template DNA and the linear, mutagenesis-primer incorporating 
newly synthesized DNA are treated with Dpnl (10 U) and Pfu DNA polymerase (2.5U). 
This results in the Dpnl digestion of the in vivo methylated parental template and hybrid 
DNA and the removal, by Pfu DNA polymerase, of the Taq DNA polymerase-extended 
base(s) on the linear PCR product. 

[0333] The reaction is incubated at 37 C for 30 min and then transferred to 72 C for an 
additional 30 min (step 2). 

[0334] Mutagenesis buffer (lx, 1 15 ul, containing 0.5 mM ATP) is added to the Dpnl- 
digested, Pfu DNA polymerase-polished PCR products. 

[0335] The solution is mixed and 10 ul is removed to a new microfuge tube and T4 
DNA ligase (2-4 U) added. 

[0336] The ligation is incubated for greater than 60 min at 37 C (step 3). 

[0337] The treated solution is transformed into competent E. coli (step 4). 

[0338] In addition to the PCR-based site-directed mutagenesis described above, other 
methods are available. Examples include those described in Kunkel (1985) Proc. Natl. 
Acad. Sci. 82:488-492; Eckstein et al. (1985) Nucl. Acids Res. 13:8764-8785; and using 
the GeneEditor™ Site-Directed Mutageneis Sytem from Promega. 
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[0339] All patents and other references cited in the specification are indicative of the 
level of skill of those skilled in the art to which the invention pertains, and are 
incorporated by reference in their entireties, including any tables and figures, to the same 
extent as if each reference had been incorporated by reference in its entirety individually. 

[0340] One skilled in the art would readily appreciate that the present invention is well 
adapted to obtain the ends and advantages mentioned, as well as those inherent therein. 
The methods, variances, and compositions described herein as presently representative of 
preferred embodiments are exemplary and are not intended as limitations on the scope of 
the invention. Changes therein and other uses will occur to those skilled in the art, which 
are encompassed within the spirit of the invention, are defined by the scope of the claims. 

[0341] It will be readily apparent to one skilled in the art that varying substitutions and 
modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. For example, variations can be made to crystallization or 
co-crystallization conditions for PDE5 A proteins and/or various phosphodiesterase 
domain sequences can be used. Thus, such additional embodiments are within the scope 
of the present invention and the following claims. 

[0342] The invention illustratively described herein suitably may be practiced in the 
absence of any element or elements, limitation or limitations which is not specifically 
disclosed herein. Thus, for example, in each instance herein any of the terms 
"comprising", "consisting essentially of and "consisting of may be replaced with either 
of the other two terms. The terms and expressions which have been employed are used as 
terms of description and not of limitation, and there is no intention that in the use of such 
terms and expressions of excluding any equivalents of the features shown and described or 
portions thereof, but it is recognized that various modifications are possible within the 
scope of the invention claimed. Thus, it should be understood that although the present 
invention has been specifically disclosed by preferred embodiments and optional features, 
modification and variation of the concepts herein disclosed may be resorted to by those 
skilled in the art, and that such modifications and variations are considered to be within 
the scope of this invention as defined by the appended claims. 
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[0343] In addition, where features or aspects of the invention are described in terms of 
Markush groups or other grouping of alternatives, those skilled in the art will recognize 
that the invention is also thereby described in terms of any individual member or subgroup 
of members of the Markush group or other group. 

[0344] Also, unless indicated to the contrary, where various numerical values are 
provided for embodiments, additional embodiments are described by taking any 2 
different specified values as the endpoints of a range. Such ranges are also within the 
scope of the present described invention. 

[0345] Thus, additional embodiments are within the scope of the invention and within 
the following claims. 
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REFINEMENT. 
PROGRAM 
AUTHORS 



Table 1 

XX-XXX-XX xxxx 



REFMAC 5.1.25 
MURSHUDOV, VAGIN, DODSON 



REFINEMENT TARGET : MAXIMUM LIKELIHOOD 



DATA USED IN REFINEMENT. 
RESOLUTION RANGE HIGH (ANGSTROMS) 
RESOLUTION RANGE LOW (ANGSTROMS) 
DATA CUTOFF (SIGMA (F)) 

COMPLETENESS FOR RANGE (%) 
NUMBER OF REFLECTIONS 

FIT TO DATA USED IN REFINEMENT. 
CROSS-VALIDATION METHOD 
FREE R VALUE TEST SET SELECTION 
R VALUE (WORKING + TEST SET) 

R VALUE (WORKING SET) 

FREE R VALUE 

FREE R VALUE TEST SET SIZE (%) 
FREE R VALUE TEST SET COUNT 

FIT IN THE HIGHEST RESOLUTION BIN. 
TOTAL NUMBER OF BINS USED 
BIN RESOLUTION RANGE HIGH 
BIN RESOLUTION RANGE LOW 
REFLECTION IN BIN 
BIN R VALUE 

BIN FREE R VALUE SET COUNT 
BIN FREE R VALUE 



. 10 
.51 



2 . 
84. 
NONE 
99.35 

23081 



THROUGHOUT 

RANDOM 

0.20593 

0.20404 

0.24234 

5.0 

1227 



(WORKING SET) 
(WORKING SET) 





20 


2 


. 100 


2 


. 155 




1696 


0 


.296 




84 


0 


.336 



NUMBER OF NON- HYDROGEN ATOMS USED IN REFINEMENT. 



ALL ATOMS 



2555 



B VALUES. 
FROM WILSON PLOT (A**2) 
MEAN B VALUE (OVERALL, A**2) 

OVERALL ANISOTROPIC B VALUE. 



NULL 
32 . 944 



Bll 
B22 
B33 
B12 
B13 
B23 



(A**2) 
(A**2) 
(A**2) 
(A**2) 
(A**2) 
(A**2) 



34 
34 
01 



-0. 67 
0.00 
0.00 



ESTIMATED OVERALL COORDINATE ERROR. 
ESU BASED ON R VALUE 
ESU BASED ON FREE R VALUE 
ESU BASED ON MAXIMUM LIKELIHOOD 

ESU FOR B VALUES BASED ON MAXIMUM LIKELIHOOD 

CORRELATION COEFFICIENTS. 

CORRELATION COEFFICIENT FO-FC : 0.957 

CORRELATION COEFFICIENT FO-FC FREE : 0.94 6 

RMS DEVIATIONS FROM IDEAL VALUES COUNT 

BOND LENGTHS REFINED ATOMS (A) : 2506 

BOND LENGTHS OTHERS (A) : 2282 

BOND ANGLES REFINED ATOMS (DEGREES): 3376 

BOND ANGLES OTHERS (DEGREES) : 5327 



(A" 



(A) 
(A) 
(A) 

■*2) 



0.195 
0.173 
0.131. 
5.040" 



RMS 
. 014 
.002 
.501 
. 947 



WEIGHT 

0.021 

0.020 

1.953 

3.000 
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3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
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3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



TORSION ANGLES, PERIOD 1 (DEGREES) 
CHI RAL- CENTER RESTRAINTS (A**3) 
GENERAL PLANES REFINED ATOMS (A) 
GENERAL PLANES OTHERS (A) 
NON-BONDED CONTACTS REFINED ATOMS (A) 
NON-BONDED CONTACTS OTHERS (A) 
NON-BONDED TORSION OTHERS (A) 
H-BOND (X...Y) REFINED ATOMS (A) 
POTENTIAL METAL-ION REFINED ATOMS (A) 
SYMMETRY VDW REFINED ATOMS (A) 
SYMMETRY VDW OTHERS (A) 
SYMMETRY H-BOND REFINED ATOMS (A) 

ISOTROPIC THERMAL FACTOR RESTRAINTS. 
MAIN-CHAIN BOND REFINED ATOMS (A**2) 
MAIN-CHAIN ANGLE REFINED ATOMS (A**2) 
SIDE-CHAIN BOND REFINED ATOMS <A**2) 
SIDE-CHAIN ANGLE REFINED ATOMS (A**2) 



NCS RESTRAINTS STATISTICS 
NUMBER OF DIFFERENT NCS GROUPS : 

NCS GROUP NUMBER : 
CHAIN NAMES 

NUMBER OF COMPONENTS NCS GROUP 
COMPONENT C SSSEQI TO C 

A 
B 
A 
B 
A 
B 



297 ; 

382 ; 

2718 ; 

489 ; 

643 ; 

2492 ; 

1437 ; 

72 ; 

1 ; 

40 ; 

74 ; 

18 ; 

COUNT 

1502 ; 

2417 ; 

1004 ; 

959 ; 



771 
086 
010 
036 
239 
232 
087 
156 
041 
636 
393 
558 

RMS 
. 616 
. 182 
.967 
. 150 



5. 000 
0.200 
0.020 
0.020 
0.200 
0.200 
0.200 
,200 
,200 
.200 
.200 
.200 



WEIGHT 
1.500 
2 . 000 
3.000 
4 . 500 



1 


A 


534 


1 


B 


534 


2 


A 


672 


2 


B 


672 


3 


A 


687 


3 


B 


687 



GROUP CHAIN 



LOOSE POSITIONAL 
LOOSE THERMAL 



(A) 
(A**2) 



1 

: A B 
: 3 
SSSEQI 

657 

657 

686 

686 

789 

789 

COUNT 
11 
11 



CODE 

6 

6 

6 

6 

6 

6 
RMS 
r 0.14 
: 4.73 



WEIGHT 

5.00 
10.00 



TLS DETAILS 
NUMBER OF TLS GROUPS : 5 

TLS GROUP : 1 
NUMBER OF COMPONENTS GROUP 



COMPONENTS 
RESIDUE RANGE 
RESIDUE RANGE 
RESIDUE RANGE 
RESIDUE RANGE 
ORIGIN FOR THE 
T TENSOR 
Til 



T33 
T13 
TENSOR 
Lll: 4 
L33: 1 
L13: -0 
TENSOR 



0. 1470 
0. 1011 
0.0027 



Sll 
S21 
S31 



0. 
0. 
-0. 



8960 
2544 
5427 

3148 
0297 
0433 



C 
A 
A 
A 
A 

GROUP 

T22 
T12 
T23 

L22 : 
L12 : 
L23: 

S12 
S22 
S32 



SSSEQI 
534 
672 
687 
804 
(A) : 



TO C 
A 
A 
A 
A 

29. 9285 



SSSEQI 
657 
686 
789 
862 

0.5264 



7 .4989 



1360 
0029 
1172 

7854 
7354 
0242 



-0.0276 S13 
-0.3301 S23 
-0.0292 S33 



TLS GROUP : 2 
NUMBER OF COMPONENTS GROUP 
COMPONENTS C SSSEQI 

RESIDUE RANGE : B 686 
ORIGIN FOR THE GROUP (A) : 

• T TENSOR 



TO C 
B 

28.9451 



0. 1681 
0.3850 
0.0152 



SSSEQI 

686 
-15.8239 



8.3689 
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Til: 




0. 3135 


T22: 


0 


.3143 


















REMARK 


3 




T33: 




0.3134 


T12: 


-0 


.0006 


















REMARK 


3 




T13: 




0.0010 


T23: 


0 


.0001 


















REMARK 


3 


L 


TENSOR 


























REMARK 


3 




Lll: 


20.8295 


L22: 


24 


.8368 


















REMARK 


3 




L33: 


60.0722 


L12: 


-4 


. 3409 


















REMARK 


3 




L13: 


-27.2632 


L23: 


-5 


.7801 


















REMARK 


3 


S 


TENSOR 


























REMARK 


3 




Sll: 




0.0102 


S12 : 


-0 


.2842 


S13 


: 0.2176 












REMARK 


3 




S21: 




0. 1711 


S22: 


-0 


. 0107 


S23 


: -0.6694 












REMARK 


3 




S31: 




0.3272 


S32: 


1 


. 0664 


S33 


: 0.0006 
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3 
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BULK 


SOLVENT MODELLING 






















REMARK 
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METHOD USED : BABINET 


MODEL WITH MASK 














REMARK 


3 


PARAMETERS 


FOR MASK CALCULATION 
















REMARK 


3 


VDW 


PROBE 


RADIUS 




1. 


40 


















REMARK 


3 


ION 


PROBE 


RADIUS 




0. 


30 


















REMARK 


3 


SHRINKAGE 


RADIUS 




0. 


30 
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3 
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OTHER REFINEMENT REMARKS: 




















REMARK 


3 


HYDROGENS HAVE BEEN ADDED 


IN THE RIDING POSITIONS 








REMARK 


3 
































LINK 






HIS 


A 


657 










LEU A 672 








gap 




LINK 






GLN 


A 


789 










LEU A 804 








gap 




CRYST1 


96 


411 


96 


411 79. 


026 


90. 


30 90.00 


120.00 F 


62 










SCALE 1 




0.010372 


C 


.005988 


0. 


D00000 




0.00000 














SCALE2 




0.000000 


C 


. 011977 


0. 


D00000 




0. 00000 














SCALE 3 




0.000000 


0 


. 000000 


0. 


D12654 




0.00000 














ATOM 


1 


N 


GLU 


A 


534 


13 


. 637 


-6. 


977 


34 .115 


1 


.00 


46. 


59 




N 


ATOM 


3 


CA 


GLU 


A 


534 


14 


. 989 


-7 . 


244 


33. 549 


1 


.00 


46. 


71 




C 


ATOM 


5 


CB 


GLU 


A 


534 


15 


.061 


-8. 


661 


32. 955 


1 


.00 


46. 


74 




C 


ATOM 


8 


CG 


GLU 


A 


534 


16 


. 480 


-9. 


164 


32. 666 


1 


.00 


46. 


21 




C 


ATOM 


11 


CD 


GLU 


A 


534 


16 


.501 


-10. 


377 


31.756 


1 


.00 


45. 


94 




C 


ATOM 


12 


OE1 


GLU 


A 


534 


15 


. 606 


-11. 


233 


31. 869 


1 


.00 


48. 


25 




O 


ATOM 


13 


OE2 


GLU 


A 


534 


17 


.409 


-10. 


481 


30. 922 


1 


.00 


47. 


41 




O 


ATOM 


14 


C 


GLU 


A 


534 


15 


.355 


-6. 


213 


32. 478 


1 


.00 


47. 


04 




C 


ATOM 


15 


O 


GLU 


A 


534 


14 


.494 


-5. 


684 


.31.767 


1 


.00 


47. 


18 




O 


ATOM 


18 


N 


GLU 


A 


535 


16 


. 652 


-5. 


954 


32.367 


1 


.00 


47. 


23 




N 


ATOM 


20 


CA 


GLU 


A 


535 


17 


.198 


-5. 


021 


31. 392 


1 


.00 


47. 


53 
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Table 2 



PCR from Human Kidney QUICK-Clone cDNA (Clontech, #7112-1) 
Protein in pET15S: 366 aa Mass: 42049.2 pi: 6.68 



1 MGSSHHHHHH SSGLVPRGSH MSAAEEETRE LQSLAAAVVP SAQTLKITDF SFSDFELSDL 

61 ETALCTIRMF TDLNLVQNFQ MKHEVLCRWI LSVKKNYRKN VAYHNWRHAF NTAQCMFAAL 

121 KAGKIQNKLT DLEILALLIA ALSHDLDHRG VNNS YIQRSE HPLAQLYCHS IMEHHHFDQC 

181 LMILNSPGNQ ILSGLS IEEY KTTLKIIKQA ILATDLALYI KRRGEFFELI RKNQFNLEDP 

241 HQKELFLAML MTACDLSAIT KPWPIQQRIA ELVATEFFDQ GDRERKELNI EPTDLMNREK 

301 KNKIPSMQVG FIDAICLQLY EALTHVSEDC FPLLDGCRKN RQKWQALAEQ QEKMLINGES 
3 61 GQAKRN 



PDE5A-S: 5 ' -GTCGTAT CATATG TCAGCAGCAGAGGAAGAAAC-3 ' 33 mer 
PDE5A-A: 5 ' -TCTGCA GTCGAC AGGCCACTCAGTTCCGCTTG-3 ' 32 mer 



pET15S sequence (PCR product; 1070 bp) 

ATATACCATGGGCAGCAGCCATCATCATCATCATCACAGCAGCGGCCTGGTGCCGCGCGGCAGCCATATG 
t cagcagcaga 

1921 ggaagaaaca agagagctac agtcgttagc ggctgctgtg gtgccatctg cccagaccct 

1981 taaaattact gactttagct tcagtgactt tgagctgtct gatctggaaa cagcactgtg 

2041 tacaattcgg atgtttactg acctcaacct tgtgcagaac ttccagatga aacatgaggt 

2101 tctttgcaga tggattttaa gtgttaagaa gaattatcgg aagaatgttg cctatcataa 

2161 ttggagacat gcctttaata cagctcagtg catgtttgct gctctaaaag caggcaaaat 

2221 tcagaacaag ctgactgacc tggagatact tgcattgctg attgctgcac taagccacga 

2281 tttggatcac cgtggtgtga ataactctta catacagcga agtgaacatc cacttgccca 

2341 gctttactgc cattcaatca tggaacacca tcattttgac cagtgcctga tgattcttaa 

2401 tagtccaggc aatcagattc tcagtggcct ctccattgaa gaatataaga ccacgttgaa 

2461 aataatcaag caagctattt tagctacaga cctagcactg tacattaaga ggcgaggaga 

2521 attttttgaa cttataagaa aaaatcaatt caatttggaa gatcctcatc aaaaggagtt 

2581 gtttttggca atgctgatga cagcttgtga tctttctgca attacaaaac cctggcctat 

2641 tcaacaacgg atagcagaac ttgtagcaac tgaatttttt gatcaaggag acagagagag 

2701 aaaagaactc aacatagaac ccactgatct aatgaacagg gagaagaaaa acaaaatccc 

2761 aagtatgcaa gttgggttca tagatgccat ctgcttgcaa ctgtatgagg ccctgaccca 

2821 cgtgtcagag gactgtttcc ctttgctaga tggctgcaga aagaacaggc agaaatggca 

2881 ggcccttgca gaacagcagg agaagatgct gattaatggg gaaagcggcc aggccaagcg 
2941 gaactgagtg gcct 

GTCGACTAGAGCCTGCAGTCTCGACCATCATCATCATCATCATTAATAAAAGGGCGAATTCCAGCACACT 
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Table 4 



LOCUS PDE5A 3106 bp mRNA linear PRI 05-NOV-2002 

DEFINITION Homo sapiens phosphodiesterase 5A, cGMP-specif ic (PDE5A), 

transcript variant 1, mRNA. 
ACCESSION NM_00108 3 

VERSION NM_001083.2 GI:15812210 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 3106) 

AUTHORS Stacey,P., Rulten,S., Dapling,A. and Phillips , S . C . 
TITLE Molecular cloning and expression of human cGMP-binding 

cGMP-specif ic phosphodiesterase (PDE5) 
JOURNAL Biochem. Biophys . Res. Commun. 247 (2), 249-254 (1998) 
MEDLINE 98308101 
PUBMED 9642111 
REFERENCE 2 (bases 1 to 3106) 

AUTHORS Yanaka,N., Kotera, J. , Ohtsuka,A., Akatsuka,H., Imai,Y., 

Michibata, H . , Fu j ishige, K . , Kawai,E., Takebayashi , S . , Okumura,K. 
and Omori , K . 

TITLE Expression, structure and chromosomal localization of the human 

cGMP-binding cGMP-specif ic phosphodiesterase PDE5A gene 

JOURNAL Eur. J. Biochem. 255 (2), 391-399 (1998) 

MEDLINE 98380237 
PUBMED 9716380 
REFERENCE 3 (bases 1 to 3106) 

AUTHORS Loughney,K., Hill,T.R., Florio,V.A., Uher,L., Rosman,G.J., 
Wolda,S.L., Jones, B. A., Howard, M.L., McAllister-Lucas , L . M . , 
Sonnenburg, W . K . , Francis , S . H . , Corbin,J.D., Beavo,J.A. and 
Ferguson, K . 

TITLE Isolation and characterization of cDNAs encoding PDE5A, a human 

cGMP-binding, cGMP-speci f ic 3 f , 5' -cyclic nucleotide 
phosphodiesterase 
JOURNAL Gene 216 (1), 139-147 (1998) 
MEDLINE 98382582 
PUBMED 9714779 
REFERENCE 4 (bases 1 to 3106) 

AUTHORS Kotera, J., Fu j ishige , K . , Imai,Y., Kawai,E., Michibata, H . , 

Akatsuka,H., Yanaka,N. and Omori, K. 
TITLE Genomic origin and transcriptional regulation of two variants of 

cGMP-binding cGMP-specif ic phosphodiesterases 
JOURNAL Eur. J. Biochem. 262 (3), 866-873 (1999) 
MEDLINE 99339957 
PUBMED 10411650 
REFERENCE 5 (bases 1 to 3106) 

AUTHORS Lin,C.S., Lau , A . , Tu,R. and Lue, T . F. 

TITLE Identification of three alternative first exons and an intronic 

promoter of human PDE5A gene 
JOURNAL Biochem. Biophys. Res. Commun. 268 (2), 596-602 (2000) 
MEDLINE 20145478 
PUBMED 10679249 
REFERENCE 6 (bases 1 to 3106) 

AUTHORS Lin,C.S., Lau, A., Tu,R. and Lue, T . F . 

TITLE Expression of three isoforms of cGMP-binding cGMP-speci f ic 

phosphodiesterase (PDE5) in human penile cavernosum 
JOURNAL Biochem. Biophys. Res. Commun. 268 (2), 628-635 (2000) 
MEDLINE 20145484 
PUBMED 10679255 
REFERENCE 7 (bases 1 to 3106) 

AUTHORS Lin,C.S., Chow,S., Lau, A., Tu,R. and Lue , T . F . 
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TITLE Identification and regulation of human PDE5A gene promoter 

JOURNAL Biochem. Biophys . Res. Commun. 280 (3), 684-692 (2001) 
MEDLINE 21092663 
PUBMED 11162575 

COMMENT REVIEWED REFSEQ : This record has been curated by NCBI staff. The 

reference sequence was derived from AF043731 . 1 . 
On Oct 1, 2001 this sequence version replaced gi : 4505666 . 
Summary: This gene encodes a cGMP-binding, cGMP-specif ic 
phosphodiesterase, a member of the cyclic nucleotide 
phosphodiesterase family. This phosphodiesterase specifically 
hydrolyzes cGMP to 5'-GMP. It is involved in the regulation of 
intracellular concentrations of cyclic nucleotides and is important 
for smooth muscle relaxation in the cardiovascular system. 
Alternative splicing of this gene results in four transcript 
variants encoding distinct isoforms. 

Transcript Variant: This variant (1) encodes the longest isoform 
(1) of this protein. 
FEATURES Location/Qualifiers 
source 1 . .310 6 

/organism="Homo sapiens" 
/db_xref="taxon: 960 6" 
/ chromosome^ " 4 " 
/map="4q25-q27" 
gene 1 . . 310 6 

/gene="PDE5A" 

/note="synonyms: CN5A, PDE5 , PDE5A1 , CGB-PDE" 
/db_x re f = " Locus I D : 8654 " 
/db xref="MIM: 603310 " 
CDS 156.. 2783 

/gene="PDE5A" 

/ EC_numbe r= " 3.1.4.17 " 

/note="cGMP-binding cGMP-speci f ic 3', 5* -cyclic nucleotide 

phosphodiesterase " 

/codon_start=l 

/product="phosphodiesterase 5A isoform 1" 
/protein id=" NP 001074.1 " 
/db_xref="GI : 4505667" 
/db xref="LocusID: 8654 " 
/db xref="MIM: 603310 " 

/translation="MERAGPSFGQQRQQQQPQQQKQQQRDQDSVEAWLDDHWDFTFSY 
FVRKATREMVNAWFAERVHTI PVCKEGIRGHTESCSCPLQQSPRADNSVPGTPTRKIS 
ASEFDRPLRPIVVKDSEGTVSFLSDSEKKEQMPLTPPRFDHDEGDQCSRLLELVKDIS 
SHLDVTALCHKI FLHIHGLISADRYSLFLVCEDSSNDKFLISRLFDVAEGSTLEEVSN 
NCIRLEWNKGI VGHVAALGEPLNIKDAYEDPRFNAEVDQITGYKTQSILCMPIKNHRE 
EVVGVAQAINKKSGNGGTFTEKDEKDFAAYLAFCGIVLHNAQLYETSLLENKRNQVLL 
DLASLI FEEQQSLEVI LKKIAATIISFMQVQKCTI FIVDEDCSDSFSSVFHMECEELE 
KSSDTLTREHDANKINYMYAQYVKNTMEPLNI PDVSKDKRFPWTTENTGNVNQQCIRS 
LLCTPIKNGKKNKVIGVCQLVNKMEENTGKVKPFNRNDEQFLEAFVIFCGLGIQNTQM 
YEAVERAMAKQMVTLEVLSYHASAAEEETRE LQSLAAAVVPSAQTLKITDFSFSDFEL 
SDLETALCTIRMFTDLNLVQN FQMKHEVLCRWILSVKKNYRKNVAYHNWRHAFNTAQC 
MFAALKAGKIQNKLTDLEILALLIAALSHDLDHRGVNNSYIQRSEHPLAQLYCHSIME 
HHHFDQCLMILNSPGNQILSGLSIEEYKTTLKI IKQAILATDLALYIKRRGEFFELIR 
KNQFNLEDPHQKELFLAMLMTACDLSAITKPWPIQQRIAELVATEFFDQGDRERKELN 
IEPTDLMNREKKNKI PSMQVGFI DAICLQLYEALTHVSEDCFPLLDGCRKNRQKWQAL 
AEQQEKMLINGESGQAKRN" 
misc feature 645. .1118 

/gene="PDE5A" 

/note="GAF; Region: Domain present in phytochromes and 
cGMP-specif ic phosphodiesterases" 
/db_xref- "CDD : smart00065 " 
misc feature 645. .1097 

/gene="PDE5A" 

/note="GAF; Region: GAF domain. Domain present in 
phytochromes and cGMP-specif ic phosphodiesterases" 
/db xref="CDD: pfam01590 " 
misc feature 1191. .1694 

/gene="PDE5A" 
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misc feature 



misc feature 



variation 



BASE COUNT 916 , 

ORIGIN 

1 gcggccgcgc 

61 gggtctcgag 

121 gtaccagaga 

181 ggcagcagcg 

241 cggtcgaagc 

301 ccaccagaga 

361 aggaaggtat 

421 cagataacag 

481 ctcttagacc 

541 aaaagaagga 

601 gctcaagact 

661 gtcacaaaat 

721 ttgtctgtga 

781 aaggttcaac 

841 ttgtgggaca 

901 ctcggttcaa 

961 tgccaattaa 

1021 caggaaacgg 

1081 tttgtggtat 

1141 gaaatcaggt 

1201 taattttgaa 

1261 ttttcatagt 

1321 aggaattaga 

1381 acatgtatgc 

1441 aggataaaag 

1501 gaagtttgct 

1561 aacttgttaa 

1621 aacagtttct 

1681 atgaagcagt 

1741 atgcttcagc 

1801 catctgccca 

1861 tggaaacagc 

1921 agatgaaaca 

1981 atgttgccta 

2041 taaaagcagg 

2101 ctgcactaag 

2161 aacatccact 

2221 gcctgatgat 

2281 ataagaccac 

2341 ttaagaggcg 

2401 ctcatcaaaa 

2461 caaaaccctg 

2521 aaggagacag 

2581 agaaaaacaa 

2641 atgaggccct 

2701 acaggcagaa 

2761 gcggccaggc 

2821 ggtgtgttct 



/note="GAF; Region: Domain present in phytochromes and 

cGMP-specif ic phosphodiesterases" 

/db xref="CDD: smart00065 " 

1191. .1664 

/gene="PDE5A M 

/note="GAF; Region: GAF domain. Domain present in 
phytochromes and cGMP-specif ic phosphodiesterases" 
/db xref="CDD: pfam01590 " 
1989. .2705 
/gene="PDE5A" 

/note=" PDEase ; Region: 3' 5' -cyclic nucleotide 

phosphodiesterase" 

/db_xre f ="CDD : pfam00233 " 

complement (433) 

/allele="G" 

/allele="A" 

/db xref="dbSNP: 3733526 " 
a 625 c 



tccggccgct 
gccgagtcct 
gcctccgagg 
acagcagcag 
atggctggac 
aatggtcaat 
cagaggccac 
tgtccctgga 
cattgttgtc 
acagatgcct 
cttggaatta 
tttcttgcat 
agacagctcc 
actggaagaa 
tgtggcagcg 
tgcagaagtt 
gaatcatagg 
tgggacattt 
tgttcttcat 
gctgcttgac 
gaaaatagct 
ggatgaagat 
aaaatcatct 
tcagtatgtc 
atttccctgg 
ttgtacacct 
taagatggag 
ggaagctttt 
ggagagagcc 
agcagaggaa 
gacccttaaa 
actgtgtaca 
tgaggttctt 
tcataattgg 
caaaattcag 
ccacgatttg 
tgcccagctt 
tcttaatagt 
gttgaaaata 
aggagaattt 
ggagttgttt 
gcctattcaa 
agagagaaaa 
aatcccaagt 
gacccacgtg 
atggcaggcc 
caagcggaac 
gcaatatgcc 



732 g 

ttgtcgaaag 
gttcttctga 
cgcgcggtgc 
cagccccagc 
gatcactggg 
gcatggtttg 
accgaatctt 
acaccaacca 
aaggattctg 
ctaacccctc 
gtgaaggata 
atccatggac 
aatgacaagt 
gtttcaaata 
cttggtgagc 
gaccaaatta 
gaagaggttg 
actgaaaaag 
aatgctcagc 
cttgctagtt 
gccactatta 
tgctccgatt 
gatacattaa 
aaaaatacta 
acaactgaaa 
ataaaaaatg 
gagaatactg 
gtcatctttt 
atggccaagc 
gaaacaagag 
attactgact 
attcggatgt 
tgcagatgga 
agacatgcct 
aacaagctga 
gatcaccgtg 
tactgccatt 
ccaggcaatc 
atcaagcaag 
tttgaactta 
ttggcaatgc 
caacggatag 
gaactcaaca 
atgcaagttg 
tcagaggact 
cttgcagaac 
tgagtggcct 
tagtttctta 



833 t 

ccggcccgac 
gggacggacc 
caaccatgga 
agcagaagca 
actttacctt 
ctgagagagt 
gctcttgtcc 
ggaaaatctc 
agggaactgt 
caaggtttga 
tttctagtca 
tgatatctgc 
ttcttatcag 
actgtatccg 
ccttgaacat 
caggctacaa 
ttggtgtagc 
atgaaaagga 
tctatgagac 
taatttttga 
tctctttcat 
ctttttctag 
caagggaaca 
tggaaccact 
atacaggaaa 
gaaagaagaa 
gcaaggttaa 
gtggcttggg 
aaatggtcac 
agctacagtc 
ttagcttcag 
ttactgacct 
ttttaagtgt 
ttaatacagc 
ctgacctgga 
gtgtgaataa 
caatcatgga 
agattctcag 
ctattttagc 
taagaaaaaa 
tgatgacagc 
cagaacttgt 
tagaacccac 
ggttcataga 
gtttcccttt 
agcaggagaa 
atttcatgca 
cacactgtct 



tggagcagga 
ccagctgggg 
gcgggccggc 
gcagcagagg 
ctcatacttt 
tcacaccatc 
cttgcagcag 
tgcctctgaa 
gagcttcctc 
tcatgatgaa 
tttggatgtc 
tgaccgctat 
ccgcctcttt 
cttagaatgg 
caaagatgca 
gacacaaagc 
ccaggccatc 
ctttgctgct 
ttcactgctg 
agaacaacaa 
gcaagtgcag 
tgtgtttcac 
tgatgcaaac 
taatatccca 
tgtaaaccag 
taaagttata 
gcctttcaac 
gatccagaac 
attggaggtt 
gttagcggct 
tgactttgag 
caaccttgtg 
taagaagaat 
tcagtgcatg 
gatacttgca 
ctcttacata 
acaccatcat 
tggcctctcc 
tacagaccta 
tcaattcaat 
ttgtgatctt 
agcaactgaa 
tgatctaatg 
tgccatctgc 
gctagatggc 
gatgctgatt 
gagttgaagt 
gtatagtgtc 



cgaaggggga 
tggaaaagca 
cccagcttcg 
gatcaggact 
gttagaaaag 
cctgtgtgca 
agtcctcgtg 
tttgaccggc 
tctgactcag 
ggggaccagt 
acagccttat 
tccctgttcc 
gatgttgctg 
aacaaaggca 
tatgaggatc 
attctttgta 
aacaagaaat 
tatttggcat 
gagaacaaga 
tcattagaag 
aaatgcacca 
atggagtgtg 
aaaatcaatt 
gatgtcagta 
cagtgcatta 
ggggtttgcc 
cgaaatgacg 
acgcagatgt 
ctgtcgtatc 
gctgtggtgc 
ctgtctgatc 
cagaacttcc 
tatcggaaga 
tttgctgctc 
ttgctgattg 
cagcgaagtg 
tttgaccagt 
attgaagaat 
gcactgtaca 
ttggaagatc 
tctgcaatta 
ttttttgatc 
aacagggaga 
ttgcaactgt 
tgcagaaaga 
aatggggaaa 
ttacagagat 
tgtatttggt 
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2881 atatactttg ccactgctgt atttttattt 

2941 gtttttagag gactattaca tattttttgt 

3001 atcaacaaca tccactgtta gcacattgat 

3061 gcaaagtgta tgcagtattc ttgcactgag 



ttgcacaact tttgagagta tagcatgaat 
atatttgttt tatgctactg aactgaaagg 
aaaagcattg tttgtgatat ttcgtgtact 
gtttttttgc ttgggg 
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